Harish Balaji Boominathan headshot

Harish Balaji Boominathan

MS Computer Engineering @ NYU

Professional Experience

Google Summer of Code – Open Source Contributor
University of California, OSPO (Jun 2025 – Sep 2025)
  • Designed privacy metrics quantifying re-identification risk across structured datasets, including a normalized risk scoring system.
  • Integrated image dataset support with pre-trained vision models (CLIP, ViT, DINOv2).
  • Prototyped a RAG-based chatbot for interactive data readiness explanations (92% satisfaction, 1,000+ users).
  • Enhanced evaluation pipeline for multi-modality and visual dashboards of quality, fairness, and privacy metrics.
Graduate Research Assistant – VIDA Lab
New York University (Jan 2025 – Present)
  • Developed multi-sensor fusion tools for 12+ nodes across Brooklyn intersections, aggregating 2+ hours of multimodal data.
  • Deployed HRNet-based pose estimation on traffic video feeds (1,000+ pedestrians, safety risk analysis).
  • Built ML pipelines to classify 5,000+ objects and compute pedestrian speed with 95%+ accuracy.

Featured Projects

El Silencio Acoustic Explorer – Edge AI for Biodiversity Monitoring
  • Multi-branch pipeline: RawAudioCNN, EfficientNetB3 (LoRA), ResNet50, PANNs-based embeddings.
  • Achieved 385% throughput increase (13,100 FPS), model size 197.5MB, latency 23ms/sample (RTX6000).
  • Edge deployment on Raspberry Pi 5, real-time metrics with Prometheus/Grafana (<75ms latency, 8+ users).
  • Integrated MLflow for experiment tracking and edge reliability monitoring.
RouteWise – Intelligent Route Optimization Engine
  • Five algorithmic solvers: Greedy, Brute Force, Dynamic Programming, Constraint Programming (OR-Tools), hybrid genetic-annealing.
  • Reduced planning time by 85% with smart algorithm selection and weighted preference scoring.
  • Minimized API usage by 80% via caching and batch requests to Google Distance Matrix APIs.
  • Route generation with Google Maps Static API, supporting multi-day, preference-weighted itineraries.
DailyPod – Automated News-to-Audio Intelligence Platform
  • Aggregates multilingual news from NewsAPI, deduplicates with custom NLP, ranks by relevance/recency.
  • Summarizes articles using GPT-3.5 with language-aware prompts.
  • Text-to-speech audio delivery via WhatsApp Business API.
  • Backend: Flask, Celery, Redis for async processing, scaling, and retry logic. Monitoring via dashboards.
XGChurn – Predictive Analytics Platform
  • XGBoost classifier (87% accuracy, 85% ROC-AUC) with SHAP interpretability.
  • Streamlit dashboard for churn probability exploration (2,000+ customers).
  • Feature engineering: credit score drop, transaction frequency, etc.

Education

New York University
Master of Science in Computer Engineering (Sep 2024 – May 2026)
GPA: 4.00 / 4.00
  • Research: Fine-tuned RoBERTa on 120K+ news articles with LoRA adapters (92%+ accuracy, 0.5M trainable params).
  • Core: Machine Learning, Deep Learning, ML Systems Engineering, Applied Matrix Theory
SASTRA University
Bachelor of Technology in Computer Science and Engineering (Aug 2020 – Jun 2024)
  • Technical Foundation: Algorithms, Operating Systems, Machine Learning, Computer Networks, Databases

Technical Expertise

Languages & Infrastructure: Python, C++, SQL, Bash, FastAPI, Git, Docker, Kubernetes, Linux
ML & Data Science: PyTorch, TensorFlow, ONNX, MLflow, Streamlit, Prometheus, Grafana
Specializations: Distributed Systems, System Design, Model Optimization, Edge Deployment, CI/CD, Privacy-Preserving ML, MLOps

Connect

This portfolio showcases the systems, models, and tools I've architected—from privacy-focused ML infrastructure to scalable backend pipelines and edge-deployed intelligence. Explore the repositories or reach out for collaboration opportunities.