Sam Rosen - CV
Education
- Summer Research Fellowship (2025)
- Honorable Mention for Teaching Assistant of the Year (2023)
- Bass Connections Fellowship (2022)
Work Experience
Machine Learning Engineering Intern
- Trained and tuned deep learning models for content-to-content recommendation systems.
- Conducted successful A/B testing to show efficacy of changes on production systems.
Machine Learning Acceleration Intern
- Added robust standard error calculations to internal simulation metrics.
- Implemented Kernel Density Estimation and Bandwidth Selection for automatically analyzing performance degradation.
Research and Development Intern
- Lead development of epiviz.gl, a JS framework for visualizing genomic data with WebWorkers and WebGL.
- Developed data selection, rendering, navigation with a pseudo grammar-of-graphics implementation.
Software Engineering Intern (Cloud Integrations team)
- Optimized and bolstered Azure crawlers responsible for crawling millions of data points an hour.
- Debugged and implemented fixes for issues found by customers in production for crawled metrics.
DevOps Intern
- Created a scalable application for live tracking of release branches to production using AWS Lambda.
- Designed serverless architecture scalable to arbitrary codebase size with complete up-to-date release data.
- Designed DynamoDB schema and frontend with React for a responsive efficient API and user interface.
Software Engineering Intern (Large-scale analytics group)
- Developed graph analytics using Java and functional programming with Gremlin for compliant graph databases.
- Programmed low-memory implementations of machine learning algorithms for training on arbitrarily large data.
- Created analytics for graph multi-edge merging, time-series, and data fusion using Java and MapReduce.
- Developed random forest algorithm on a distributed data system for classifying attributes on graph vertices.
Software Engineering Intern
- Worked with a partner to build a microservice acting as a REST backend to serve PDF’s with Spring.
- Created a service to read generated reports through a REST API on a dynamic front-end with React.
Publications
-
Sam Rosen and Jason Xu (Aug. 2026). “Constrained Weighted Bayesian Bootstrap”. In: Conference on Uncertainty in Artificial Intelligence 42, Accepted and to appear. arXiv: 2606.04237 [stat.ME]. Github.
-
Sam Rosen and Jason Xu (May 2026). “Affinity Graph Connectivity in Convex Clustering“. In Submission. arXiv: 2605.24673 [stat.ML]. Github.
-
Sam Rosen, Eric C. Chi, and Jason Xu (Apr. 2026). “Biconvex Biclustering“. In Submission. arXiv: 2604.03936 [stat.ML]. Github.
-
Jiachang Liu, Sam Rosen, Chudi Zhong, and Cynthia Rudin (Dec. 2023). “OKRidge: Scalable Optimal k-Sparse Ridge Regression”. In: Advances in neural information processing systems 36. NeurIPS 2023 Spotlight Paper, pp. 41076–41258. arXiv: 2304.06686 [cs.LG]. Github.
Teaching Experience
- Developed extensive original material
- Developed material/guest lecturer as part of BASS Connections
Selected Projects
- Developed for Genentech (advised by: Jayaram Kancherla) to visualize genomic data seamlessly via declarative specifications and WebGL.
- Designed to visualize millions of data points and entire chromosomes at 60 FPS with high precision.
- Created a no-dependency package to make asynchronous logging easy with a highly customizable API.
- Published on PyPI with complete test code coverage, continuous integration, and extensive documentation.
- Completed semester long project researching the NP-HARD problem of the most efficient way to pack circles.
- Formulated an algorithm which packs circles in linear time achieving competitive densities near 70 percent.
Miscellaneous
- Programming Languages: Python***, R***, Java***, JavaScript***, Julia**, C/Cpp**, Matlab*, Scala*, WebGL*
- Reviewer: 2026 AAAI MURE Workshop
- Research Interests: Statistical Computing, Convex Optimization, Clustering, High-performance Computing