ML Engineer

Swaminathan Sankaran

I design and ship end-to-end ML systems — from model training through drift monitoring to production deployment.

About

A bit about me

I'm a graduate student at the University at Buffalo pursuing an MS in Data Science. I got into ML because I wanted to build things that actually work — not just models that perform well in a notebook, but systems that hold up in production.

That mindset led me to MLOps, drift monitoring, and containerized deployments during my time as an ML engineer at Zolvit, and it drives the projects I work on now — from multimodal drug discovery to medical imaging security.

I'm currently looking for ML engineering roles where I can bring production-grade systems thinking to real problems.

Projects

Selected Work

MLOps / Production Systems

Drift-Aware MLOps Pipeline

End-to-end system that detects silent model degradation through statistical drift monitoring and triggers automated retraining — recovering performance without manual intervention.

  • Seven-stage Airflow DAG: data extraction, drift detection, retraining, smoke testing, model promotion
  • Full containerized stack — FastAPI, MLflow, Evidently, Prometheus, Grafana — via Docker Compose
  • Automated retraining recovered +0.1178 ROC-AUC on recession-shifted data

Multimodal Deep Learning

Molecule Similarity Prediction

Tri-modal contrastive learning system for drug discovery — fuses 2D molecular images, 3D atomic geometry, and chemical fingerprints to predict pairwise similarity.

  • Pretrained three encoders (ResNet-18, SchNet, MLP) on 291k molecules using NT-Xent contrastive loss
  • Achieved ~0.92 Pearson correlation on expert-annotated pairs, beating the Tanimoto baseline
  • Late-fusion architecture with per-modality cosine similarities for interpretable decomposition

Medical Imaging

Patch-Level CT Tamper Classification

Tamper detection for volumetric CT scans that identifies localized manipulation at the patch level — a harder problem than whole-scan forgery detection.

  • 3D convolutional compressor converts 16-slice patches into 2D feature maps for ImageNet-pretrained ResNet-18
  • Trained on 169 volumetric lung CT scans with 5-fold cross-validation
  • Achieved 0.95 validation AUC, outperforming 2.5D, full 3D, and projection baselines

Experience

Where I've Worked

Machine Learning Engineer, Intern

Zolvit

Feb 2024 — Aug 2024
  • Eliminated ~23 hours/week of manual data entry by building an OCR + T5-large extraction pipeline for legal documents
  • Improved legal retrieval precision by ~28% across 10,000+ documents using hybrid Elasticsearch keyword + Pinecone vector search
  • Built a document routing classifier achieving 92% accuracy on TF-IDF/Doc2Vec features, automating intake workflows
  • Containerized ML services with Docker and deployed on AWS EC2, maintaining 99.9% uptime with Grafana monitoring

Education

Academic Background

University at Buffalo

MS in Engineering Science (Data Science)

Aug 2025 — Dec 2026

Relevant Coursework

Statistical Learning I & II Machine Learning Probability Theory Database Data Science Data Models & Query Languages Numerical Methods

Vellore Institute of Technology

B.Tech in Computer Science and Engineering (AI & ML)

Aug 2019 — Jul 2023

Relevant Coursework

Machine Learning Deep Learning Reinforcement Learning Computer Vision Applied Linear Algebra Statistics Data Structures & Algorithms Database Management Systems

Skills

Technical Toolkit

Languages & Databases

Python
C+C/C++
SQLSQL
RR
PostgreSQL
MySQL
Elasticsearch
PNPinecone
Snowflake
Linux

AI & Machine Learning

PyTorch
TensorFlow
Scikit-learn
XGXGBoost
Hugging Face
LCLangChain
LGLangGraph
RGRAG
TBTensorBoard
NumPy
Pandas

MLOps & Cloud

Docker
Kubernetes
AWSAWS
Airflow
MLflow
EVEvidently AI
Prometheus
Grafana
Git
Linux

Data & Visualization

Spark
MPMatplotlib
SBSeaborn
Plotly
Streamlit
TBTableau
BIPower BI
Pandas
NumPy
Git
RDRDKit

Awards & Certifications

Recognition

Get in Touch

Let's build something that works.

Open to ML engineering roles and collaborations. Reach out through any of these channels.