profile.jpg

Jacob Si

PhD Student at Imperial College London

Hi, I’m Jacob, a Computer Science PhD student at Imperial College London advised by Yingzhen Li and Marek Rei. My research interests span various domains including tabular learning, generative models, variational inference, language agents and post-training alignment. Previously, I was a Research Intern at Samsung AI Research (2025).

Before joining Imperial, I grew up in Singapore and Malaysia, then earned my Bachelor’s from the University of Toronto and my Master’s from the University of California, Los Angeles. I was fortunate to be advised by Rahul Krishnan at the Vector Institute, and Jonathan Kao at UCLA SEAS. Lastly, I am grateful to my instructors, Jimmy Ba and David Duvenaud, for sparking my initial interest in machine learning.

If you are interested in research or would like to collaborate, please feel free to email me!

Email:

‘y.si23‘ @ ‘imperial.ac.uk‘

‘jacobyhsi‘ @ ‘ucla.edu‘

‘jacobyhsi‘ @ ‘cs.toronto.edu‘

Selected Publications [full list]

(*) denotes equal contribution

  1. NeurIPS
    Variational Uncertainty Decomposition for In-Context Learning
    I. Shavindra Jayasekera*, Jacob Si*, Filippo Valdettaro, Wenlong Chen, Aldo Faisal, and Yingzhen Li
    In the 39th Conference on Neural Information Processing System, 2025.
  2. ICMLSpotlight
    InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation
    Jacob Si, Wendy Yusi Cheng*, Michael Cooper*, and Rahul Krishnan
    In the 41st International Conference on Machine Learning, 2024.
    Spotlight Presentation [top 3.5%]
  3. TMLR
    TabRep: Training Tabular Diffusion Models with a Simple and Effective Continuous Representation
    Jacob Si, Zijing Ou*, Mike Qu*, Zhengrui Xiang*, and Yingzhen Li
    In the Transactions on Machine Learning Research, 2026.
  4. Workshop
    TabRAG: Improving Tabular Document Question Answering for Retrieval Augmented Generation via Structured Representations
    Jacob Si*, Mike Qu*, Michelle Lee, Marek Rei, and Yingzhen Li
    In the 39th Conference on Neural Information Processing System, AI for Tabular Data Workshop, 2025.
  5. Submission
    Diffusion Alignment Beyond KL: Variance Minimisation as Effective Policy Optimiser
    Zijing Ou, Jacob Si, Junyi Zhu, Ondrej Bohdal, Mete Ozay, Taha Ceritli, and Yingzhen Li
    In submission.
  6. Book Chapter
    Assessing Infant Mortality Rate: Problems stemming from Household Living Conditions, Women’s Education and Health
    Jacob Si, and Rohan Alexander
    In "Telling Stories with Data: With Applications in R" by Rohan Alexander