Resume

Experience

epiFi Technologies Pvt. Ltd.

Fintech, Neobanking Startup by Gpay Founders

Data Scientist [June 2020 - Present]

I joined here as a part of the top@50 early members of the organization and second in the data science team. I built core features and structured the data science team, making the new-generation millennial banking experience more personalized and user-friendly.
Setup MLOps components: Airflow for Automated Model Pipelines, Data Version Control, BentoML for Model Inferencing system, Kibana for Logging
Responsibilities
- Hiring best talent for the company
- Mentoring Interns and Freshers in Data Science team
- Business requirements understanding, communicating metrics and helping leadership team in taking data-driven decisions
Problems Solved:
- Ranking of Users on Fi Waitlist using Multi-Criteria Decision Making Article Link
- Retaining Users on Waitlist by sighting them with our feature Insights powered by NLP Data Science Model. Here we look up the user's merchant mails and extract price using a NER Model. Example of an Insight from this feature: "You have spent 10000 in the last 3 months at Amazon." making users aware and conscious of their online spending. Product Demo Link
- Understanding customer support requirements and helping customer support agents by automating the issue categorization process. Issue categorization helps us know the friction points of our product. Powered with Zero-Shot-Classifier Model and list of predefined issue tags.
- Improving in-app help using data by making changes to the help screen so one can find answers faster. To make it easier to find the right answer after landing on the wrong search result, added 'Related articles.' such that related articles are semantically similar to the one you are currently reading. Then this entire flow of finding related articles is automated by Airflow.
- Based on user-features Demographics, Geographics, Technographics Features, Income, etc. group users to find accurate user peer groups using K-Prototypes clustering Model. Outcome: Over this peer group show personalised user-insights.

1mg Technologies Pvt. Ltd.

HealthTech, e-pharmacy platform

Data Scientist [Feb 2019 - June 2020]

Problem Solved:

On product pages of 1mg, we showed a Similar Products List which was to improve platform diversification by allowing a user's to explore and exploit the platform. This similar product widget was powered by BERT-word embedding on meta-data of the product.
Personalized Recommendation of Products using Neural Collaborative Filtering, this personalized recommended list in comparison to the Popular Products list increased Add To Cart by 32% and CTR by 38%.
Worked on the entire homepage metric pipeline from setting up CTAS queries on Athena to Scheduling Script on Airflow with a flock webhook that posts daily metrics.
Worked on Trending Products Widget on Search, which is on ZScore Moving Average idealogy. It helped us gain 60% orders by this widget when compared in A/B test with a popular product list. Medium Blog
Finding Disease Trends time series analysis.
Ranking reviews by relevance: sorting reviews by relevance. Research Paper Link
Worked on search algorithm enhancement, reduced unsupported search by 20% using NLP methods, LDA, TextRank and TF-IDF on product descriptions to extract related keywords and phrases.
Worked on Product Filter Tags automatic generation for products that helps customers to easily find a product on our page. Filter Tags on Left Side of Category Page
Multilabel Classification, Auto Categorization of Products to different Categorizes using product features like color, name, uses, product form, ingredients, etc.
Creating an NLP library that has the ability to pre-process data, extract features, do phonetic matching, string matching, get readability score, etc.
Awarded First Data Science Sprint Hero @1MG Link

InterviewBit/Scaler Academy

Edtech platform

Data Scientist (SME) Subject Matter Expert [Nov 2018 - June 2020]

InterviewBit is an E-learning Platform (Top #10 startups of India by LinkedIn) which also conducts hiring challenges.
For Machine Learning/ Data Science challenges, I have been their problem setter and subject matter expert. This experience taught me a lot, I got to work on different problems like Text Classification, Text Summarization, Keyword Extraction from Text, Regression and Classification problems. Created 14+ problems for InterviewBit which were used in Pan India competitions of which in one of them, more than 200k+ developers participated.

Skills

Programming Languages	Framework, Tools and Services	Data Platforms used	Professional Skills
Python	Airflow	PostgreSQL	Natural Language Processing
SQL	Data Version Control	Athena	Time Series Analysis
C++	BentoML and Cortex	Snowflake	Automation with Selenium
PySpark	MLFlow	S3	Machine Learning
	FastAPI	Cockroach DB	Data Analysis
	AWS	DynamoDB	Data Visualisation
	Kibana: Monitoring and Logging
	Metabase: Analytics tool

Education

Jaypee Institude of Information Technology

Bachelors of Technology (Hons.) - Computer Science (CGPA - 8.6)

Research Papers

Uppal, S., Jayal, A., & Arora, A. (2019). Pairwise Reviews Ranking and Classification for Medicine E-Commerce Application. LINK
Uppal, S., Jain, A., & Arora, A. (2020, January). Comparative analysis for keyterms extraction methods for personalized search engines. In 2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence) (pp. 358-363). IEEE. LINK

Data Science Video Courses by Shaurya

Awards

Got awarded 4 times sprint champion at 1mg
Best paper presentation award in IEEE Conference: Confluence 2020
Awesome Open Source Projects - PyWhatsapp LINK