Hi, I'm
Harshit Kumar Gupta

Senior Software Engineer at Amazon

SDE II at Amazon | Machine Learning, CV, NLP Practitioner | Cloud Architect
San Diego, California, United States

A versatile Machine Learning professional building scalable and responsible AI Applications using ML models and evaluating them on various KPIs, then deploying as microservices. Passionate about learning new technologies and mastering them in a very short time span.

Career Highlight

Currently leading ML-powered anomaly detection systems at Amazon, processing High volume of Selling Partner Payouts.

Self-motivated team player
Strong analytical & problem solving
Creativity & innovation
Leadership qualities

Machine Learning Expert with specialization in Computer Vision, NLP, Deep Learning, and Large Language Model.

Current Tech Stack: Python, Java, TensorFlow, AWS Lambda, DynamoDB, Apache Spark, Docker
10+ years across Amazon, Loyalty Platforms, CognitiveScale, Intel Security, and Cadence Design Systems.

Harshit Kumar Gupta
ML
AI
🚀
Machine Learning
Deep Learning
Computer Vision (CV)
Natural Language Processing (NLP)
Anomaly Detection
Feature Engineering
Model Interpretability

Featured Works

Here are some of my most impactful projects across Amazon, Loyalty platforms, and AI systems, showcasing expertise in distributed systems, machine learning, and large-scale data processing.

🛡️
JavaLambdaGlue +12

Disbursement Gatekeeper

Intelligent Request Gating and Anomaly Detection of Seller Disbursement handling High volumne of Monthly Disbursements using ML models for failure prediction, pending anomaly detection, bank grouping, and result code clustering

Key Achievements
  • Designed scalable system predicting anomaly detection for Selling Partner Disbursements
  • Training machine learning models for predicting anomalous scores based on past patterns
  • Built Bookkeeper to keep track of failures at various indexes
JavaLambdaGlueStep FunctionHerdApolloRedisDynamoDBHubbleRedshiftSageMakerLogistic RegressionNeuralProphetTop2VecFuzzyWuzzy
PythonSageMakerLambda +6

Smart Charging Time Recommendation System

ML-powered charge timing optimization system for Amazon Seller Debt Manager (SDM) to improve debt recovery rates and charge success rates through intelligent retry scheduling

Key Achievements
  • Designed statistical voting classifier ensemble model to predict optimal charge retry times for failed transactions
  • Improved overall charge success rate from 40% baseline using temporal and regional features (Region, Card Type, Hour, Day)
  • Built end-to-end ML pipeline with SageMaker training, inference endpoints, and Lambda integration
PythonSageMakerLambdaStatistical Voting ClassifierEnsemble MethodsHerd WorkflowsStep FunctionsGlueAppConfig
🎯
JavaSpring BootDynamoDB +7

Gravty (SaaS for Customer Engagement and Loyalty Management)

Designing offers and campaigns for customer attraction with cashback and points management. Customer targeting using on-fly offer creation and dynamic rules. Building Predictive and analytics model for customer, location retain and product, offer recommendation.

Key Achievements
  • Designed scalable transaction layer for realtime and batch processing
  • Designed Drools based execution engine to execute custom rules created from blockly
  • Designed Incremental Data Processing Pipeline in Data Lake
JavaSpring BootDynamoDBLambdaFargateDroolsBlocklyApache HudiPySparkAWS
📈
Pythonscikit-learnApache Livy +3

Debt Risk Hotspots ML Model

Discover patterns of features of accounts that are conducive to bad debt.

Key Achievements
  • Interpreting model to understand similar bad debt accounts
  • Defining similarities of accounts quantitatively by using feature importance
  • Stratification to understand characteristics of clusters
Pythonscikit-learnApache LivySparkCatBoostSHAP
👥
PythonPandasOR-Tools +1

Patient Scheduling Constraint Optimizer

Recommend ideal patient schedules based on patient preferences and optimal use of resources.

Key Achievements
  • Building a Patient Scheduling model to efficiently schedule medical appointments
  • Using patient preferences, medical rules, time slots and facility availability as constraints
  • Solving complex combinatorial optimization using OR-Tools
PythonPandasOR-ToolsDocker
📈
Pythonpandasscikit-learn +3

Bad Debt Risk Advisor ML Model

Building predictive model to determine bad debt for Hospital Billing and Professional Billing

Key Achievements
  • Building a model for predicting bad debt accounts
  • Developing KPI for measuring model performance
  • Building jobs for distributed data processing in Hive Using Spark
Pythonpandasscikit-learnApache LivySparkCatBoost
🧠
JavaPythonXgBoost +3

Predictive User Engagement on Tax Filing System

Improve user engagement and increase Tax Filing by using the ML model.

Key Achievements
  • Building model to handle historical data and click through data both
  • Building a predictive model for user intervention
  • Improving model response time in Production to handle peak season load
JavaPythonXgBoostMongoDBDockerAWS

Technologies & Tools I Work With

Python
Java
AWS Lambda
DynamoDB
Apache Spark
Docker
TensorFlow
Scikit-learn
Redis
CloudWatch
Step Functions
Machine Learning

Professional Experience

My journey through the tech industry, from software engineering to machine learning, across leading companies in AI and enterprise software.

Senior Software Engineer (SDE II)

Amazon.com Services LLC
July 2022 - Present
San Diego, California, USA

Leading development of intelligent disbursement systems and anomaly detection for seller payments failures.

Key Projects
Disbursement Gatekeeper

Intelligent Request Gating and Anomaly Detection of Seller Disbursement handling High volumne of Monthly Disbursements using ML models for failure prediction, pending anomaly detection, bank grouping, and result code clustering

JavaLambdaGlueStep Function +11 more
  • Designed scalable system predicting anomaly detection for Selling Partner Disbursements
  • Training machine learning models for predicting anomalous scores based on past patterns
Smart Charging Time Recommendation System

ML-powered charge timing optimization system for Amazon Seller Debt Manager (SDM) to improve debt recovery rates and charge success rates through intelligent retry scheduling

PythonSageMakerLambdaStatistical Voting Classifier +5 more
  • Designed statistical voting classifier ensemble model to predict optimal charge retry times for failed transactions
  • Improved overall charge success rate from 40% baseline using temporal and regional features (Region, Card Type, Hour, Day)

Senior Software Engineer (SDE II)

Amazon Development Centre (India)
September 2020 - July 2022
Hyderabad, Telangana, India

Designed scalable systems for journal processing and AWS native service migration with focus on distributed data processing.

Key Projects
SPURSH

Aggregates transaction details into journal entries that are posted to the General Ledger

JavaFargateBatchOFA +3 more
  • Designed scalable system for JournalPostmanService and JournalPostmanPreprocessor
  • Removed Journal status publishing and ticketing dependency from OFA

Product Engineering Architect

Loyalty Juggernaut
June 2019 - September 2020
Hyderabad, Telangana, India

Architected cloud platform for customer engagement and loyalty management with predictive analytics, building ML models for customer retention and product recommendation.

Key Projects
Gravty (SaaS for Customer Engagement and Loyalty Management)

Designing offers and campaigns for customer attraction with cashback and points management. Customer targeting using on-fly offer creation and dynamic rules. Building Predictive and analytics model for customer, location retain and product, offer recommendation.

JavaSpring BootDynamoDBLambda +6 more
  • Designed scalable transaction layer for realtime and batch processing
  • Designed Drools based execution engine to execute custom rules created from blockly

Senior Software Development Engineer

CognitiveScale
November 2017 - June 2019
Hyderabad, Telangana, India

Building scalable and responsible AI Applications using ML models and evaluating them on various KPIs, then deploying as microservices.

Key Projects
Debt Risk Hotspots ML Model

Discover patterns of features of accounts that are conducive to bad debt.

Pythonscikit-learnApache LivySpark +2 more
  • Interpreting model to understand similar bad debt accounts
  • Defining similarities of accounts quantitatively by using feature importance
Patient Scheduling Constraint Optimizer

Recommend ideal patient schedules based on patient preferences and optimal use of resources.

PythonPandasOR-ToolsDocker
  • Building a Patient Scheduling model to efficiently schedule medical appointments
  • Using patient preferences, medical rules, time slots and facility availability as constraints
Bad Debt Risk Advisor ML Model

Building predictive model to determine bad debt for Hospital Billing and Professional Billing

Pythonpandasscikit-learnApache Livy +2 more
  • Building a model for predicting bad debt accounts
  • Developing KPI for measuring model performance
Predictive User Engagement on Tax Filing System

Improve user engagement and increase Tax Filing by using the ML model.

JavaPythonXgBoostMongoDB +2 more
  • Building model to handle historical data and click through data both
  • Building a predictive model for user intervention

Senior Software Development Engineer

Cadence Design Systems
February 2017 - October 2017
Noida, Uttar Pradesh, India

Developed EDM (Enterprise Data Management) solutions for collaborative library and design data management.

Key Projects
Allegro EDM Solutions

Collaborative Library and design data management system

J2SESwingTinker PopSQLg +2 more
  • Involved in development of EDM (Enterprise Data Management)
  • Designed DAO layer to support SQL, NoSQL and Graph Databases

Senior Software Development Engineer

Intel Security (McAfee)
July 2015 - February 2017
Gurgaon, Haryana, India

Worked on Application Control and Change Control with Global Threat Intelligence systems.

Key Projects
Application Control and Change Control

Global Threat Intelligence system for application security

J2EEMFS (Spring based Framework)JavaScriptMS SQL
  • Drove several features independently and contributed to end-to-end delivery
  • Led feature to support SHA-256 for Rule Groups and Policy Discovery

Research Projects

Academic research in Machine Learning, Computer Vision, and NLP during my M.Tech at IIT Delhi and independent ML research projects with open-source contributions.

M.Tech Thesis
🎓

Assessment of Autism Spectrum Disorder in Toddlers using Speech Features

Supervisor: Dr. Santanu Chaudhury, Dept. of Electrical Engg, IIT Delhi

Designing an Android App to collect voice sample and store in the cloud

Key Contributions
  • Analysis of Speech Samples using Spectrogram and Scalogram
  • Feature Extraction using Discrete Wavelet Transform and Discrete Wavelet Packet Analysis
  • Classification of speech samples using SVM, Random Forest, HMM, CNN classifiers
  • Application of Deep Learning Convolutional Neural Network for Feature Learning and Classification
Pythonnumpyscipyscikit-learncaffepylearn
Fake Profile Detection using Machine Learning
Research Project

Fake Profile Detection using Machine Learning

Detect fake profiles in online social networks using multiple machine learning techniques

Key Contributions
  • Implemented Support Vector Machine, Neural Network, and Random Forest algorithms
  • Developed comprehensive fake profile detection system for social media platforms
  • Created Jupyter notebooks for interactive analysis and model comparison
  • Achieved high accuracy in distinguishing authentic vs fake social media profiles
Pythonscikit-learnpandasnumpymatplotlibpybrain
Twitter Sentiment Analysis using Machine Learning
Research Project

Twitter Sentiment Analysis using Machine Learning

Sentiment analysis of tweets using machine learning and natural language processing techniques

Key Contributions
  • Implemented Naive Bayes and SVM models for sentiment classification
  • Developed comprehensive text preprocessing pipeline with stop words removal
  • Created lexicon-based sentiment analysis using positive/negative word dictionaries
  • Built scalable sentiment prediction system for real-time Twitter data analysis
Pythonnltkscikit-learnpandasnumpy

Research Specializations

🧠
Machine Learning
Deep Learning, CNN, SVM, Random Forest, HMM classifiers for autism detection
🎵
Signal Processing
Speech analysis using Spectrogram, Scalogram, and Wavelet Transform
💬
NLP & Social Media
Sentiment analysis, profile detection, and social media analytics
10+
Years Experience
4
Research Projects
11
ML models in Production
IIT
Delhi Alumni
Open Source Contributions
67+ GitHub Stars
87+ Forks
Open Source Projects

Academic Projects

Course projects and academic assignments covering web development, mobile applications, security systems, and data processing during my academic journey.

📸
Academic Project

Image Encryption & Decryption and Transformation

Secure & Compressed image transfer system

Key Features
  • Implemented secure image transfer with compression
JavaCryptography
👩‍⚕️
Academic Project

Aayush

Web platform for interaction of Doctors and Patients using J2EE with Struts Framework

Key Features
  • Built complete doctor-patient interaction platform
J2EEStrutsWeb Development
🌤️
Academic Project

Weather Forecasting Application

Using Yahoo Weather API and support for Offline Queries

Key Features
  • Implemented weather forecasting with offline support
API IntegrationMobile Development
📸
Academic Project

Illuminance Correction

Android App for Illuminance Correction on Image using OpenCV for Android

Key Features
  • Developed mobile app for image correction
AndroidOpenCVImage Processing

Academic Skills Developed

🌐
Web Development

J2EE, Struts Framework, Full-stack development

📱
Mobile Development

Android apps, OpenCV integration, Image processing

🔐
Security & Encryption

Cryptography, Secure data transfer, Compression

🛠️
API Integration

External APIs, Data processing, Offline capabilities

Want to see more projects?

Recommendations & Shout Outs

What colleagues, managers, and collaborators say about working with me across Amazon, AI startups, and enterprise software companies.

KT
🧠
Kranthi Tej

Program Manager

CognitiveScale Former Manager
"

Harshit is a very talented individual who comes up with innovative methodologies to solve ML problems. He is very good at implementing these models in live applications. He did an amazing job in enhancing risk prediction solution with significant amount of operational constraints. Very capable individual to be on a team dealing with challenging problems.

"
Key Skills Highlighted:
Machine LearningRisk PredictionInnovation
PK
🧠
Prajna Kandarpa

Engineering Manager

CognitiveScale Former Manager
"

Harshit is a delightful engineer and one of the most amicable people I've worked with. His experience in signal processing using deep learning techniques came in quite handy for our work at CognitiveScale. I must add that he singlehandedly built an entire data science pipeline capable of handling multiple hundred requests/sec traffic throughput. A great problem solver and very reliable. Any software engineering team would love having a member like Harshit on their roster.

"
Key Skills Highlighted:
Deep LearningSignal ProcessingData Pipeline
PJ
🛡️
Pankaj Joshi

System Test & DevSecOps Strategist

Intel Security Senior Colleague
"

Harshit joined my team 1.5 years back, Harshit came with very strong fundamental and conceptual knowledge, and soon acquired good understanding of the product & process. I found him always motivated to grab the complex work and he has shown his innovative ways to simplify things. He is a problem solver and an asset to my team.

"
Key Skills Highlighted:
Problem SolvingDevSecOpsInnovation
AC
🛡️
Amit Chopra

Senior Program Manager

Intel Security Team Colleague
"

Harshit is very committed, exhibits true enthusiasm at work and is a very good team player. He is technically very sound. He was a tremendous asset to our group and was always capable of handling multiple assignments. He is quick to understand things and has good debugging skills. He also shows sense of urgency and is able to complete his work on time. Harshit at many times has stretched to meet tight deadlines.

"
Key Skills Highlighted:
Team LeadershipDebuggingProject Management

🎉 Internal Shout-Outs

Recognition from Amazon colleagues for exceptional work

📦
Udit Khimesra
Senior Software Engineer • Amazon
🏆 Internal Recognition
October 27, 2021
"

Thanks Harshit for doing seamless delivery of JPPS Optimization Change. It was a complex change and required working closely with AE team.

"
Amazon Leadership Principles Demonstrated:
Deliver ResultsOwnershipInsist on the Highest Standards
2 likes
🎯 Complex Delivery Achievement

Professional Resume

Comprehensive overview of my qualifications, certifications, and achievements in Machine Learning and Software Engineering.

Technical Skills

🤖 Machine Learning & AI
Machine LearningDeep LearningComputer Vision (CV)Natural Language Processing (NLP)Anomaly DetectionFeature EngineeringModel Interpretability
📚 ML Frameworks
TensorFlowScikit-learnPyTorchKerasXgBoostCatBoost
💻 Programming Languages
PythonJava 8JavaScriptR
☁️ AWS Cloud
LambdaFargateGlueStep FunctionsCloudWatchCDKBatch
🗄️ Databases
PostgreSQLMongoDBRedisDynamoDB +3 more
📊 Big Data & Analytics
Apache HiveApache SparkApache HudiPySparkHadoop Ecosystem
🚀 Specializations
Distributed SystemsMicroservicesData Processing PipelinesReal-time AnalyticsPredictive Modeling

Education

🎓
2013-2015
M.Tech in Computer Technology

Indian Institute of Technology, Delhi

GPA: 8.43/10

Thesis: Assessment of Autism Spectrum Disorder in Toddlers using Speech Features
🎯
2009-2013
B.Tech in Computer Science & Engineering

Kamla Nehru Institute of Technology, Sultanpur

Grade: 81.16%

Certifications

📜
AI Agents Fundamentals
Industry Certification 2024
📜
Machine Learning
Stanford University, Coursera 2015
📜
Algorithms: Design and Analysis, Part 1
Stanford University, Coursera 2015
📜
Algorithms: Design and Analysis, Part 2
Stanford University, Coursera 2015
📜
Image and Video Processing
Duke University, Coursera 2015
📜
J2EE Struts with Hibernate
Professional Certification 2016
📜
Getting and Cleaning Data
Data Science Certification 2016
📜
R Programming
Statistical Computing Certification 2016

Awards & Achievements

🏆

BugBash for MAC 8.0 Award - Intel Security

🏆

3rd prize winner in Mind-Hunters event, National Level Techfest, Effluence

🏆

2nd prize winner in Fill Up The Code event, Tech Carnival By Computer Society Of India

🏆

Consolation prize in Paper Presentation on "WEB 3.0", organized by I.E.I.

🏆

Consolation prize in Technical Wordsworth event organized by Computer Society of India

🏆

State Level project on Water Resource Conservation in National Children's Science Congress

Professional Qualities

🧠

Self-motivated team player with strong analytical, problem solving, planning and resource optimization skills

💡

Possess creativity & innovation, flexibility & adaptability and interpersonal skills with leadership qualities

🚀

Passionate about learning new technologies and tools and mastering those in a very short time span

Download Full Resume

Get the complete PDF version of my resume with detailed project descriptions and technical specifications.

Download PDF Resume