Alexander George Padula

alexander.george.padula@gmail.com
PadLex
alex-padula
EU & US Citizen

EDUCATION

ETH Zurich
2024 - 2026
MSc Computer Science
(ML major)
Maastricht University
2021 - 2024
BSc Data Science and AI
Summa Cum Laude

SKILLS

Python
(PyTorch, JAX, VERL, TensorFlow)
HPC
(SLURM, Grace-Hopper chips)
JavaScript
(React, Node.js, TypeScript)
C
(CUDA)
Java, C#

PUBLICATIONS

We implemented a new board-game DSL in JAX: games compile to XLA, enabling RL training entirely on the GPU. Offers up to 10x speedup against previous SoTA DSL and lays groundwork for training general game-playing models.
We designed a system to generate novel board games in a DSL using a LLM as a mutation operator. Automated evaluation through play-through sampling between general game-playing agents, with heuristics to score runs.
In my BSc thesis, I explored novel PPO variations to stabilize LLM training with explicit rewards (now known as RLVR). Restored KL-divergence to the classic RL formulation and introduced a novel batch-entropy exploration bonus.

PROFESSIONAL EXPERIENCE

Mistral (incoming) - Research Internship
Paris, FR Sep 2026 - *
I'll work on reward-based post-training for code at Mistral and could choose to stay for an industry PhD.
SID.ai - Research Fellowship
San Francisco, USA Jun 2026 - Aug 2026
Developing a new Policy Gradient method for agentic search.
Apertus AI - Research Assistant
Zurich, CH Oct 2026 - Jun 2026
Contributed to the online RL pipeline for Apertus, a fully open-source 70B LLM. Improved code sandbox stability and reduced training time 6x.
Apple - Research Internship
Seattle, USA July 2024 - Sep 2024
Authored an internal paper on code generation with LLMs. Presented research to 60+ employees, gaining traction across departments with the potential to impact the next product cycle. Learned to share complex ideas concisely.
Maastricht University - Honors Program
Maastricht, NL Sep 2022 - Sep 2023
Developed a coding assistant for the Ludii DSL. Trained a LLM and overcame data scarcity using grammar constrained decoding; had to develop a parser for incomplete code that could mask illegal tokens during inference.
Maastricht University - TA & Tutor
Maastricht, NL Nov 2021 - July 2024
Covered tuition and rent by serving as a university teaching assistant and tutoring privately in computer science.
Vivid Vision - Contract Work
Remote Jun 2021 - Oct 2021
Built web interface currently used by glaucoma patients and their doctors to manage data and view test results. Worked with founders and dev team. Limited technical debt and managed state across complex React applications.
CELI Language Technology - Internship
Turin, IT Nov 2018 - Aug 2019
Built and deployed a content-based image search engine. Mentored by Dr. Bolioli’s team, I trained BERT and Inception V3 to embed images and descriptions in a shared latent space. This project underpinned my high school thesis.
Programize LLC - Internship
Athens, GR Aug 2018 - Sep 2018
Worked in the dev team of a mobile digital payment application. Debugged Node.js API and built a prototype in React. Operated in a large codebase and loved working in a team. Received an offer to return and a recommendation letter.

PROJECTS

IBM Research: Machine Learning on Analog Devices
Zurich, CH Feb 2026 - Jun 2026
In collaboration with IBM, I'm developing a post-training method to make LLMs more robust to the noise introduced by analog computations. This is a critical step toward commercializing IBM's in-memory-compute devices.
Zurich, CH Oct 2025 - Jan 2026
We derived Muon's preconditioner and found that Muon operates beyond the classical edge of stability. We also empirically validated that the flat minima hypothesis holds for Muon’s solutions in a realistic setting.
Reinforcement Learning Agent for Diplomacy-style game
Zurich, CH Sep 2025 - current
Exploring RL for massively multi-agent, mixed-motive games. Developing open-source gym API for Open Front.
Zurich, CH Apr 2025 - Jun 2025
Only course project selected by IVIA lab for publication (under review). We built a LLM-assisted study environment that syncs recordings and exercises into a visual knowledge graph to quickly navigate between multimodal-sources.
EDMO
Maastricht, NL Sep 2023 - Feb 2024
Designed novel data-passthrough joints and contributed to a ML model for inferring a modular robot’s configuration.
Open-source Laser Cutting Tool
Treviso, IT Jan 2020 - Jan 2021
Built 3D printer and transformed it into a DIY laser cutter. Developed the open-source Python library svg-to-gcode. The tool’s GUI has garnered over 20,000 downloads and a community that helps maintain it.

PASSIONATE

about hiking, rock climbing, cooking, and robotics. Native English and Italian speaker.