profile

Simran Deepak Makariye

Interests : Data Science, Deep Learning and Computer Vision (Generative AI)

About Me

Hello! I'm Simran Makariye , pursuing my M.S. degree in computer science at NYU-Courant. Currently interning at Audible, I'm working on some interesting recommendation systems, employing reinforcement learning to deliver personalized content to customers. Previously, I spent over 2 years at Qualcomm as a software engineer, focusing on machine learning and location technology. Additionally, I've collaborated on numerous deep learning and computer vision projects with professors/researchers from NYU and Georgia Tech.

My primary areas of interest are deep learning, computer vision, and Generative AI. Beyond building these models to address real-world challenges, I'm interested in developing more efficient and scalable AI solutions, making them practical and accessible. Therefore, I'm actively learning about accelerating deep learning workloads and optimizing compute utilization.

I am always open to collaborate with like-minded individuals who share my passion towards deep learning. If you have any intriguing research projects or work opportunities that align with my interests, please do reach out over email at sdm8499 [at] nyu [dot] edu. I am eager to continue learning and making meaningful contributions to these exciting fields.

Projects

Hate-LLaMA: An Instruction-tuned Audio-Visual Language Model for Hate Content Detection

Fine-tuned Video-LLaMA, which effectively processes video and audio information multimodally, on a hate video dataset using 4 RTX8000 GPUs, achieving an F1-score of 73% on the unseen test dataset. Additionally, released a benchmark dataset comprising 300 labeled hate/non-hate videos to address the scarcity of labeled hate video datasets. Also, deployed a demo for Hate-LLaMA (Multimodal LLM) on HuggingFace Spaces using Gradio.

Video Frame Prediction and Semantic Segmentation with Self-Supervised Learning

Implemented a fully CNN architecture - SimVP - for video frame prediction and trained a U-Net architecture for Semantic Segmentation on a diverse synthetic dataset consisting of video clips of 3D moving objects achieving a Jaccard Index of 0.251 with minimal training and fine-tuning.

Multi-Image Fusion and Semantic Segmentation for Power Substation Delineation

Collaborating with Transition Zero, a leading clean energy technology company, to develop deep neural models that fuse multiple revisits of low-resolution Sentinel-2 L1C imagery and generate a super-resolved image and then applying semantic segmentation task to accurately delineate power substations from the super-resolved image.

Integrated Content Management Platform - New York Public Library

Developing a generalized content management pipeline for the NYPL website, incorporating deep learning techniques to enhance collection item accessibility and enable automated meta-data tagging. Also, integrating crowdsourcing technologies to further refine and optimize these solutions, resulting in improved user experience and streamlined content management processes.

Semantic Role Labeling on NomBank Dataset

Implemented a fully CNN architecture - SimVP - for video frame prediction and trained a U-Net architecture for Semantic Segmentation on a diverse synthetic dataset consisting of video clips of 3D moving objects achieving a Jaccard Index of 0.251 with minimal training and fine-tuning.

Smart Mental Health Monitoring using Wearable Sensors

Designed and implemented an LSTM-based stress detection model using time-series data consisting of both wrist features and chest features emulated as wrist features. Achieved a remarkable accuracy of 90%.

Human Neuron Inspired CNN architecture for Music Emotion Recognition

Designed a CNN-based Music Emotion Recognition model that mimics the way a human brain responds to an emotion i.e. by firing a specific set of neurons for each emotion. It yielded an accuracy of 82% with the additional benefit of fewer computational parameters.

Work Experience

September 2023 - Present

Audible, Amazon

Data Science Intern - Product & Insights Team

July 2023 - October 2023

Georgia Institute of Technology

Research Assistant, Advisor: Prof. Eva Dyer

Research focus: DL in Neuroscience

May 2023 - September 2023

New York Univesity

Research Assistant, Advisor: Prof. Vasant Dhar.

Research focus: Deep Learning in Finance

June 2020 - July 2022

Qualcomm

Software Engineer - Location Technology Team

May 2019 - July 2019

Qualcomm

Software Engineering Intern - Location Technology Team

May 2018 - July 2018

Indian Institute of Technology, Hyderabad

Research Intern - Advisor: Prof. Manohar Kaul

Education

September 2022 - Present

New York University (Courant Institute of Mathematical Sciences)

Master of Science in Computer Science

Coursework:
Large Language and Vision Models, Deep Learning, Computer Vision, Big Data and ML Systems, Cloud and Machine Learning, Natural Language Processing, Algorithms, Operating Systems.

Teaching Assistant:
1. Intro to CV (Taught by Prof. Jean Ponce) - Spring 2024
2. Deep Learning (Taught by Prof. Yann Lecun and Prof. Alfredo Canziani) - Spring 2024, Fall 2023
3. Programming Bootcamp - Math in Finance - Summer 2023

July 2016 - June 2020

National Institute of Technology, Tiruchirappalli

B.Tech in Computer Science and Engineering and Minor Degree in Management

Coursework:
Natural Language Processing, Machine Learning, Probability Theory, Linear Algebra, Optimization Techniques.

Contact Me

Call On

(+1) 929 636 9424

Follow me

LinkedIn