I’m a machine learning engineer with 4 years experience training transformers, including frontier LLMs at ChatGPT scale—I build multimodal and bilingual models and run distributed training across NVIDIA and AMD hardware.
In recent roles, I ran model training efforts for next-generation LLMs. I helped design training curricula, tune optimization strategies, and guide data strategy for diverse multilingual and multimodal use cases. My work included scaling up distributed training on both NVIDIA and AMD accelerators, profiling kernels, and tightening data/compute pipelines to maximize throughput and downstream performance.
Before moving into AI, I spent 2 years in quantitative finance, applying statistical tools to pricing, risk, and portfolio construction. That experience informs my pragmatic approach to metrics, uncertainty, and reproducibility when shipping models with strict reliability requirements. It also gave me an introduction to the investing world, and informs my literacy with finance and dealmaking.
I earned an MSc in Advanced Computer Science from Oxford, where I concentrated on natural language processing systems. I graduated top of my Mathematics class at Notre Dame and received a BA in Honors Mathematics with a concentration in Computing.
I want to learn and make you money. I personally produce or review every deliverable and ensure our clients have what they need to make great decisions.