Resume

Jonathan Simkin, PhD

Data and analytics leader and PhD epidemiologist. I build and lead across applied AI and ML, population-scale data systems, and the research and reporting they make possible, mostly in healthcare and other high-stakes, regulated settings.

PhD-trained data and analytics leader with 10+ years in regulated, high-stakes healthcare environments. My recent work is applied AI: NLP and language-model systems running in production. It sits on a longer arc of transforming population-scale data systems and turning them into research and reporting people act on. A player-coach: I lead a 20+ person multidisciplinary pod of data scientists, data engineers, and subject matter experts while staying close to model design, evaluation, and deployment. I pair hands-on technical depth with Responsible AI governance, and I translate technical trade-offs into clear decisions for executives.

GenAI & LLMs

LLMs RAG Agents Fine-tuning Copilot

Applied ML & NLP

NLP & transformers Classification Ensembles Evaluation & calibration

Production ML & MLOps

Model lifecycle Drift & performance monitoring Gated promotion Human-in-the-loop Model audits

Statistics & inference

Experimental & study design Bayesian & Poisson regression Time-series Spatial models (BYM2) Forecasting

Responsible AI & governance

Model risk governance Bias & fairness evaluation Interpretability Privacy-preserving ML Data stewardship & privacy

Cloud & tooling

Azure Python SQL R PyTorch HuggingFace Shiny Git / Linux

Leadership & strategy

AI strategy Roadmap ownership Hiring & mentoring Executive advisory Grants & funding

Director, BC & Yukon Cancer Registries · Provincial Health Services Authority

May 2023 – Present
Vancouver, BC
  • Lead a 20+ person multidisciplinary pod (data scientists, ML engineers, analysts, domain experts) as a hands-on technical leader, owning roadmap, priorities, and quality standards while staying close to model design and production readiness. Manage a $2.5M operating budget.
  • Own end-to-end delivery of classification and predictive ML, putting MLOps practices in place: model versioning, gated promotion from PoC to production, and drift and performance monitoring.
  • Stood up Azure cloud environments and Copilot agents for secure AI workflows, partnering with platform teams on identity, security, and deployment for ML and generative AI workloads.
  • Established Responsible AI and model risk governance covering bias auditing, fairness evaluation across sex and ethnicity subgroups, human-in-the-loop review, and interpretability. Secured $1.3M+ in competitive funding.

Scientific Director, BC & Yukon Cancer Registries · BC Cancer

Oct 2019 – May 2023
Vancouver, BC
  • Designed and deployed a transformer-based NLP classification pipeline (PyTorch, HuggingFace) processing 5M+ records per year, replacing legacy rule-based workflows for a 100x throughput improvement in production.
  • Architected an ensemble of fine-tuned language models with dual preprocessing pipelines, achieving 98 to 99% recall in production with documented fairness across sex and ethnicity subgroups.
  • Built statistical and Bayesian modeling capability for population-level risk estimation and forecasting, including time-series, Poisson, joinpoint, and BYM2 spatial models, supporting resource allocation.
  • Led structured model audits when production performance diverged from test metrics, redesigning training data and restoring performance; served as data steward in a privacy-regulated environment.

Cancer Epidemiologist · Government of Yukon

Dec 2015 – Oct 2019
Whitehorse, YT
  • Advised the Chief Medical Officer of Health, delivering predictive analytics and statistical models to inform public health policy and resource allocation.
  • Authored the territory's recurring cancer statistics reports and dashboards, turning complex population data into clear recommendations senior decision-makers acted on.

Production NLP classification pipelines · 5M+ records/yr

Led problem framing, fine-tuning of domain-pretrained transformers, ensemble design, evaluation, and production deployment with drift monitoring and human-in-the-loop review. Embedded fairness evaluation across demographic subgroups to meet regulated-environment requirements. Operational since 2023.

PyTorch·HuggingFace·Azure·MLOps

Generative AI pilot with Responsible AI governance

Designed and governed a generative AI pilot for a high-priority workload manual processes couldn't keep up with, using retrieval-augmented generation grounded in vetted content to limit hallucination risk. Secured authorization to run generative AI with privileged data access in a privacy-regulated environment.

RAG·LLMs·Governance

Real-time diagnostic data platform

Secured $418K and oversaw delivery of a streaming platform that captures every provincial diagnostic imaging report, cutting ingestion latency and enabling ML-based health-system analytics.

Streaming·Azure·Data integration
Adjunct Professor, School of Population and Public Health
University of British Columbia · 2025 – Present
Founder & Lead, Pan-Canadian AI in Cancer Community (PACC-AI)
2023 – Present
Co-Chair, Executive Committee, Canadian Council of Cancer Registries
2021 – Present
Canadian Representative, High-Level Strategic Group, NAACCR
2024 – Present
Academic peer-reviewer, health and data science journals
2016 – Present
PhD, Population and Public Health
University of British Columbia · 2016 – 2022
Master of Public Health (MPH)
University of British Columbia · 2014 – 2016
BSc, Cell Biology and Genetics
University of British Columbia · 2009 – 2014