Research

(* denotes equal contribution; dagger denotes corresponding author)

Preprints

Journal Papers

Conference Papers

Miscellaneous

Software

  • OASIS_stat: python package installable via pip. Created 2024 based on OASIS: pypi, documentation, Github.

  • SPLASH: optimized package for reference-free genomic analysis. Created 2023, Github.

Presentations and talks

  • “Genomic inference without a reference genome.” MIT Computational and Systems Biology (CSB) Seminar 2025.

  • “Statistical Integration of Bulk and Single-Cell Sequencing for Improved TCR Repertoire Analysis.” Schmidt Center Symposium on Biomedical Science and AI: poster + lightning talk 2025.

  • “Statistical Integration of Bulk and Single-Cell Sequencing for Improved TCR Repertoire Analysis.” Cold Spring Harbor Laboratory, Biological Data Science: poster + lightning talk 2024.

  • “Reference-free Genomic Inference for Unbiased Discovery.” ClearNote Health (a liquid biopsy cancer detection company), 2024.

  • “SPLASH: A statistical, reference-free genomic algorithm unifies biological discovery.” RECOMB Highlight, 2024.

  • “Unbiased biological discovery without a reference genome.” MIT Health Science Technology Conference, 2024.

  • “Statistical and algorithmic challenges in reference-free analysis.” Broad MIA Seminar (video) 2024.

  • “SPLASH: A statistical, reference-free genomic algorithm unifies biological discovery.” DF/HCC Celebration of Early Career Investigators in Cancer Research 2024. Best Poster award.

  • “A statistical reference-free genomic algorithm subsumes common workflows and enables novel discovery.” Cold Spring Harbor Laboratory: Biological Data Science meeting 2022. Platform presentation.

  • “Bandit-based Monte Carlo Optimization.” Cornell ORIE Young Researcher's workshop 2021. Poster.

  • “Bandit-based Monte Carlo Optimization for Nearest Neighbors.” Baylearn 2020 (Symposium). Poster.

  • “Adaptive Monte Carlo Optimization: Ultra Fast Medoid Identification via Correlated Sequential Halving.” Baylearn 2019 (Symposium). Poster.

  • “DAMN fast: DNA Alignment using Multi-armed baNdits: Spectral Jaccard Similarity for long-read alignment.” Intelligent Systems for Molecular Biology (ISMB/ECCB 2019). Poster.

  • “Ultra Fast Medoid Identification via Correlated Sequential Halving.” 2019 North American School of Information Theory (School). Poster.

Teaching

  • Information Theory (Stanford, EE276/Stats376a): Winter 2021-2022

  • Probability & Random Processes (Berkeley EECS 126): TA in Spring 2017, Head TA in Spring 2018

  • Discrete Math and Probability, Efficient Algorithms and Intractable Problems (Berkeley CS70 and CS170): reader in Fall 2015, Spring 2016 respectively