AI/ML Clinical Data Science Intern

Lexeo Therapeutics

Posted 3 months ago

Internship

New York, New York

In Person

Smart Summary

Responsibilities

You will design and execute data-driven analyses using clinical study data to generate insights on disease progression and gene therapy mechanisms. This involves building analytical pipelines, training machine learning models, and collaborating with scientific stakeholders to interpret findings.

Qualifications

Lexeo Therapeutics is seeking an AI/ML Clinical Data Science Intern to analyze clinical study data and generate insights into rare cardiovascular conditions and gene therapies. The ideal candidate will be pursuing a degree in a quantitative field (e.g., Computer Science, Statistics) or a life science with AI/ML experience, and have proficiency in Python, R, or SQL. Experience with clinical data and AI model development is a plus.

Job Description

Lexeo Therapeutics is a clinical-stage genetic medicine company headquartered in New York City, pioneering cardiac genetic medicine candidates to treat the root causes of inherited cardiovascular diseases. Our lead program, LX2006, targets cardiomyopathy associated with Friedreich’s Ataxia and anchors a broader pipeline addressing genetically defined conditions such as hypertrophic and arrhythmogenic cardiomyopathies. Backed by a strong financial foundation, Lexeo is positioned to translate groundbreaking science into durable clinical impact.  

\nRole Summary

This internship sits at the intersection of artificial intelligence, machine learning, and clinical drug development. You will work directly with Lexeo’s clinical and scientific leadership to design and execute data-driven analyses using real clinical study data. The overarching goal is to generate novel insights that inform our understanding of:

  • Disease burden and progression in rare cardiovascular conditions
  • Mechanisms of action of AAV-based gene therapies in human subjects
  • Novel clinical endpoints and biomarkers that could strengthen future study designs

This is hands-on, hypothesis-driven R&D work. You will not be running pre-packaged reports or prompting general-purpose AI tools — you will be building and deploying analytical pipelines, training models, and contributing to scientific interpretation alongside domain experts.

Primary Responsibilities
  • Write scripts in Python, R, SQL, and/or Cypher to extract, join, and transform clinical data from internal and external sources
  • Build and work within structured databases and data lakes to organize multi-modal clinical datasets
  • Perform rigorous data validation, cleaning, and QC to ensure analytical readiness
  • Train, validate, and deploy ML models — including supervised, unsupervised, and generative AI approaches — applied to clinical and biomarker data
  • Apply advanced statistical techniques relevant to small-n clinical datasets, including mixed-effects models, survival analysis, and dimensionality reduction
  • Evaluate model performance, interpretability, and clinical relevance in collaboration with scientific stakeholders
  • Use analytical platforms such as Power BI, Excel, and Maxis to generate summaries, dashboards, and visualizations for cross-functional audiences
  • Synthesize findings from LEXEO’s internal clinical studies alongside relevant external data sources
  • Participate in working sessions with clinical scientists, statisticians, and CMC colleagues to discuss analytical direction and results
Required Skills and Qualifications
  • Currently enrolled in a college or university (undergraduate or graduate level) in Computer Science, AI/ML, Statistics, Biostatistics, Bioinformatics, or a related quantitative field, OR  enrolled in a biological or life science program with demonstrable, substantive experience in applied ML/AI through coursework, research, or prior internships
  • Hands-on experience building, training, or deploying machine learning or AI models in an academic project, research lab, or prior work setting
  • Proficiency in at least one of: Python, R, SQL, or Cypher for data manipulation and analysis
  • Ability to work independently on open-ended analytical problems, make methodological decisions, and communicate trade-offs clearly
Nice to have: 
  • Familiarity with clinical or biomedical data (EHRs, biomarkers, imaging, clinical trial datasets)
  • Experience with Power BI or similar BI/visualization platforms
  • Background in cardiovascular biology, rare diseases, or gene therapy
  • Prior exposure to generative AI model development or large language model fine-tuning
  • Experience working within regulated or GxP-adjacent data environments
\n$50 - $50 an hour\n

Lexeo Therapeutics

Based in New York City, Lexeo Therapeutics is a clinical-stage genetic medicines company dedicated to transforming healthcare by applying pioneering science to fundamentally change how disease is treated. Building on groundbreaking research from Weill Cornell Medicine and the University of California San Diego, Lexeo partners with preeminent institutions on the cutting edge of gene therapy research. Using a stepwise development approach, Lexeo is leveraging early proof-of-concept functional and biomarker data to advance a pipeline of cardiovascular and APOE4 associated Alzheimer's disease programs, and is led by pioneers and experts with decades of collective experience in genetic medicines, rare disease drug development, manufacturing and commercialization. For more information, please visit www.lexeotx.com.
Runway Icon
Boost Your Interview Chances

With Runway

See Your Fit for This Role

1-5 min

Your Score

?

Top Applicants

90%

Your Job Search Advantage

Key Gaps & Next Steps:

Address these in your resume & Interview

Top Strengths For This Role

Highlight these in your cover letter & interview

Your Interview Guide

A Personalized Interview Strategy

Freshest Opportunities

Never Miss a Good Fit

Get notified when jobs mach your criteria