Navpreet Singh

Graduate Student at New York University (Courant)

About Me

I am a masters student at NYU Courant primarily interested in Statistics, Machine Learning and Deep Learning. I have an experience of 2+ years in Software Engineering including Backend Development and Machine Learning .

Education

New York University

Master of Science in Computer Science

2021 - Present

New York University (NYU) is a private research university in New York City. Founded in 1831 by a group of prominent New Yorkers as an institution to "admit based upon merit rather than birthright or social class"

I am graduated from NYU Courant with a degree of Masters in Computer Science.

Delhi Technological University

Bachelor of Technology in Computer Science

2014 - 2018

Delhi Technological University (DTU), formerly known as the Delhi College of Engineering (DCE) is a state university in New Delhi, India. It was established in 1941 as Delhi Polytechnic.

I am graduated from DTU with a degree of Bachelor of Technology in Computer Engineering.

Work Experience

Cigna ( Evicore Healthcare )

Machine Learning Engineer

May 2022 - Dec 2022

evicore.com

eviCore empowers the improvement of care by connecting patients, providers, and health plans with intelligent, evidence-based solutions to enable better outcomes.

I performed EDA to analyze distributions, find correlations and gain understanding of health data used for prior authorization to approve complex treatment or prescription by health insurance provider for the patients. I also worked on deploying machine learning models in Azure Cloud by creating pipelines.

Wissen Technology

Software Engineer

November 2020 - January 2021

wissen.com

Wissen is a leading technology consulting and solutions company practice primarily geared towards the domains of Banking & Finance, Telecom, and Healthcare.

My work focused on training and implementing a Language Model for the Intelligent Search application for our client Baker Hughes. I implemented pdf parsing on the technical pdf documents as well as the BERT Transformer language model using sentence similarity for intelligent search.

MetricStream

Member of Technical Staff - Developer

September 2018 - July 2020

metricstream.com

MetricStream is the world’s largest independent provider of governance, risk, and compliance (GRC) products & solutions.

My primary focus was on the backend development of the GRC Application for Risks and Audit modules with some work on custom configuration for the GRC application as well on the M7 platform.

Research Experience

Urban Complexity Lab, NYU

Graduate Research Assistant

April 2021 - Aug 2022

ucomp.net

Urban Complexity Lab at NYU’s Center For Urban Science + Progress is unfolding complexity of urban systems for research, innovation and applications. We leverage big urban data and cutting edge machine learning and network analysis techniques to make our cities more smart, efficient, sustainable, resilient – better places to live in.

I worked on implementing and evaluating novel Graph Neural Network architecture for learning the structure features of the networks of human mobility and interactions.

Teaching Experience

New York University

Teaching Assistant for Applied Data Science ( CUSP-GX 6001 )

September 2021 - Aug 2022

cusp.nyu.edu

CUSP is an interdisciplinary research center dedicated to the application of science, technology, engineering, and mathematics in the service of urban communities across the globe

I worked with the professor introducing new content for the class. Conducting weekly recitation introducing urban case studies, getting students familiar with data science practices and libraries like pandas, matplotlib. As well as grading homeworks and answering student doubts.

Projects

Classifying AI relevant papers from Arxiv paper dataset

github.com/navpreetnp7/Classifying-arXiv-paper-using-GPT2

GPT2 classifier using few shot learning and language model classification

Implemented zero and few shot learning using GPT2 transformer from huggingface to classify AI and non AI relevant papers with just 10 examples in the training. Also experimented with different classification layers and convolution on top of GPT2 language model output training it with 20 samples, getting better score on precision-recall metrics compared with the few shot learning approach

Generating fake images conditioned on different facial features

github.com/navpreetnp7/conditional-DCGAN

Fake image generator conditioned on facial attributes

Implemented and trained Conditional DCGAN as a part of NYU Computer Vision Course project, using the celebrity dataset with facial attributes and base deep convolutional GAN architecture with Minimax loss function. Conditioned the generator on facial attributes like hair in order to produce fake images containing such attributes.

Algorithmic Trading Bots leveraging Machine Learning ( Independent Study in collab with Citibank )

github.com/navpreetnp7/Trading-Bots

Intelligent trading bots using Machine Learning.

Designed and developed trading bots running Percent of Volume (POV) algorithm to execute trades on Eurodollars securities leveraging Machine Learning to predict the market volumes and classify them as Bull/Bear and executing trades accordingly.

Traffic vehicle tracking and speed estimation

github.com/navpreetnp7/Car-Tracking-and-speed-estimation

Track vehicles on the road and estimate their speed.

Implemented using Pretrained YOLOv4 on COCO dataset for object detection and DeepSort Model for object tracking. The application is able to detect vehicles in a video footage and estimate their speed in pixels/sec.

Simple chatbot using seq2seq encoder-decoder model

github.com/navpreetnp7/Chatbot

Simple chatbot trained on Movie Dialogues Dataset

I trained a simple seq2seq model Chatbot using Pytorch based on multi-layered Gated Recurrent Unit (GRUs) and Attention mechanism using movie scripts dataset from the Cornell Movie-Dialogs Corpus.

Automatic detection of malignant tumors in the lungs using CT scans

github.com/navpreetnp7/LUNA-Challenge

Detection of tumors using Lung Nodule Analysis in CT Scans

I trained an end-to-end complex deep neural network using Pytorch based on Lung Nodule Analysis dataset of chest CT scans, finding the nodules using simple CNN residual network architecture and then identifying the tumours using state of the art U-Net Architecture for Object Detection and Image segmentation and then using transfer learning to use the same resnet model to train and classify them as benign/malignant.

Classification of DNA Sequence based on promoter region (where protein contact with DNA helix)

I cleaned and prepared UCI DNA sequences data. Used exploratory data analysis to do feature engineering and trained the dataset on KNN, Decision Trees, SVM, Random Forest and Adaboost classifiers using pandas and sklearn to select the best performing model which was SVM.

Predicting house prices using Liner Regression trained on Ames Housing Dataset

I cleaned, prepared, and visualised Ames Housing dataset and performed feature selection and feature engineering using pandas, seaborn and scipy. Then trained the data on Linear Regression techniques lasso and gradient boosting using sklearn and used ensemble method stacking and averaging the models to produce the best fit for the data.

A Little More About Me

Alongside my interests in Software Engineering and Machine Learning, some of my other interests and hobbies are:

  • Book Reading. My recent favourites include Sapiens, Humankind: A hopeful history (Non Fiction)
  • Basketball
  • Travelling (Beaches, Mountain Trekking)