Job Experience

DeepNeuronic

Software Engineer / ML Researcher - 1 Year:

  • Created a custom theft action dataset and benchmarked hundreds of Large Video Language Models (LVLMs) for theft detection in surveillance applications.
  • Researched and implemented multi-camera tracking methods to improve robustness across multiple views..
  • Built a large-scale age and gender dataset using public, manual annotated, and synthetic data generated via generative models; trained and deployed an age and gender classification model.
  • Created and annotated a personal protective equipment (PPE) dataset and trained a computer vision detection model to ensure compliance in real-world scenarios.
  • Optimized core code for efficiency, reduced dependencies, vulnerabilities and documented the codebase for maintainability and onboarding.

Projects and Publications

Survey of Demonstration Learning

Comprehensive survey of demonstration learning, covering problem formulation, learning methods, benchmarks, applications, and open research challenges.

Project 1

Hierarchical Decision Transformer

Advanced state-of-the-art sequence modeling in offline reinforcement learning using a hierarchical architecture.

  • High-level and low-level transformers for task decomposition.
  • High-level model proposes milestone states for the low-level model to reach.
  • Milestones guide the policy to achieve complex tasks without prior knowledge or user interaction.

Decision Mamba Architectures

Further advanced the state-of-the-art sequence modeling in offline reinforcement learning by employing the Mamba architecture.

  • Decision Mamba and Hierarchical Decision Mamba improve upon transformer predecessors in performance, size, and inference speed.
  • Decision Mamba offers the best performance, while only employing one model.
  • The evolutionary parameter provides the guidance signal missing in transformers.
Project 3

DEFENDER

Algorithm to increase the safety of RL policies.

  • Significantly reduces crash rates while improving overall policy performance.
  • Uses small sets of safe and unsafe demonstrations.
  • Compares the current trajectory with each demonstration.
  • Terminates unsafe trajectories and disincentivizes dangerous behavior.
  • Significantly reduces crash rates while improving overall policy performance.
DEFENDER

Multi-View Contrastive Learning from Demonstrations

Framework to learn task-relevant visual representations from demonstrations.

  • Leverages demonstrations recorded from multiple viewpoints.
  • Trains an image encoder to produce viewpoint-invariant features.
  • Uses these features as environment states to train reinforcement learning policies.
  • Policies perform across unseen viewpoints and can extend to variations in illumination or background.
CLfD

Music to Dance as Language Translation

Transformer and Mamba based framework to translate music to dance.

  • Developed two Architectures: Mamba and Transformer based.
  • Conducted ablation study on the effects of different music features.
  • Proposed an algorithm to translate to translate music features to dance poses.
  • Applied to a UR3 robotic arm and generalized to unseen choreographies.

About Me

I am a Machine Learning researcher and engineer with a Ph.D. in Computer Science, specializing in demonstration and reinforcement learning. I have published work in top peer-reviewed venues. I have 1+ year experience of industry experience applying ML models in production environments.

I have experience developing and deploying computer vision and ML models in production, working with large-scale datasets, and evaluating state-of-the-art models across robotics and surveillance domains.

I am particularly interested in ML engineering applied research roles.

My Resume