Arduin Findeis

Hi there! I am a machine learning (ML) researcher who likes to build software. My research focuses on the evaluation of ML systems: which systems are “better” or “worse”. In particular, for language model and reinforcement learning applications. Recent projects include Feedback Forensics, Inverse Constitutional AI (ICAI) and Beobench. I am a PhD candidate in the Department of Computer Science at the University of Cambridge and member of the AI4ER CDT, and was recently ML intern at Apple. Reach out by sending an email or scheduling a call.

🗞️ News

07/2025: 👋 I’ll be at ACL 2025 in Vienna, let’s meet!

05/2025: 🎉 LLM-as-a-Judge tool-use paper accepted at ACL 2025

04/2025: New blog post: Analysis of Llama 4 on Chatbot Arena

03/2025: 🚀 Launch of Feedback Forensics app: try it now!

01/2025: 🎉 Inverse Constitutional AI accepted at ICLR 2025