Talks

A collection of my invited talks, presentations, workshops, and panel discussions. Click on the years to navigate.

Case Study: How Does DeepSeek’s FlashMLA Speed up Inference

Toronto Machine Learning Summit, 2025

Gave a talk in the Inference Scaling track on how the Flash Multi-Head Latent Attention algorithm for NVIDIA Hopper GPUs works.

Scaling Large Language Models: Getting Started with Large-Scale Parallel Training of LLMs

TMLS Workshop, 2025

Conducted a workshop on distributed training (data parallelism, FSDP, Tensor Parallelism) of Large Language Models (LLMs) using the MinText JAX library I wrote.

A Practitioner's Guide to Safeguarding Your LLM Applications

Toronto Machine Learning Summit, 2024

Conducted workshop to explore safeguarding of Large Language Models (LLMs) in production for data scientists, researchers, and CTOs.

Implementing Structure Mapping in Deep Learning Models for Abstract Reasoning

Analogical Minds Seminar Spring Series, 2022

Presented seminar on my paper Neural Structure Mapping (NSM) for systematic analogical reasoning to an audience of cognitive scientists, psychologists, and computer scientists.

Breaking into AI: Industry Speaker Panel

University of Toronto Machine Intelligence Unit Panel, 2021

Appeared on a panel to educate University of Toronto undergraduate students about the different career avenues available to pursue within Artificial Intelligence