About

Analytics and Data Science professional with a keen interest in machine learning and statistics. By comprehensive exposure to the underlying concepts and applying them vividly to various projects, my love for these domains came into being. I am a passionate individual who thrives to build and apply models to solve real-world industry problems.

As a way to recharge and pursue my personal interests, I often take time off to engage in activities such as motorbike riding, hiking, kayaking, and swimming. I also engage in more structured physical activities, such as playing badminton, football and enjoy watching my favorite sports games. To stay informed about developments in my field, I also make it a point to read blogs on business and technology trends.

  • Birthday: 26 September 1997
  • Phone: +1 (000)-000-0000
  • City: New York
  • Email: sunilkamkar97@gmail.com

Interests

Mathematics

Machine Learning

Deep Learning

Natural Language Processing

Data Engineering

Visualization

Statistics

Analytics

Education

MS in Business Analytics

July 2022 - May 2023
Relevant Coursework
  • Advanced Optimisation
  • Advanced Machine Learning
  • Marketing Analytics

PG Diploma in Data Science

July 2018 - July 2019
Relevant Coursework
  • Advanced Big data
  • Data Visualization
  • Natural Language Processing

BS in Computer Science, Mathematics, Statistics

June 2015 - May 2018
Relevant Coursework
  • Applied Statistics
  • Linear Algebra
  • Probability Distributions

Experience

2Blocks Analytics

January 2023 - May 2023

Machine Learning Engineer

  • Built 91% accurate classification models using GPT-3 to streamline business document processing
    • Fine-tuned Transformers and LLMs namely S-BERT, MPNET, and GPT-3 to embed unstructured text into vector representations, enabling semantic document comparisons and classification
  • Collaborated with start-up founders to develop text analytics product and deploy it on AWS EC2 instances
Tech: Python, SQL, GPT-3, AWS, Transformer Neural Nets, Natural Language Processing, Anvil, Model Pipeline and Deployment

Genpact

June 2021 - June 2022

Data Scientist

  • Led 4+ real-world evidence (RWE) projects independently and completed 11 projects in 12 months as founding data science team member, enabling team expansion from 3 to 6 members. Mentored colleagues to elevate best practices in coding.
    • Created the customer profiles to evaluate the impact of events to perform common statistical analysis, propensity score matching and survival analysis (Cox-PH model and Kaplan Maier Estimation)s.
  • Designed and devised a dynamic R shiny dashboard; providing real-time insights and enabling the client's analytics team to make informed decisions quickly and effectively resulting in saving hours of efforts everyday.
    • Created robust data models using AWS, handling 10 data sources, and efficiently managing over 100M rows of data.
  • Optimized market sizing estimation through implementation of spline regression, improving predictive accuracy by 8% and strengthening risk projections.
Tech: Python, R, R Shiny, SQL, Excel, Power BI

Eversana

Feb 2021 - June 2021

Associate Data Analyst

  • Developed an Alteryx workflow to analyze customer transitions between physicians and compute the average time gap for switching products, providing insights into physician performance and patient behavior
  • Automated and optimised SQL codes for GPO inventory tracking
  • Performed promotional analysis for product to evaluate market access, performance, and market share using python
Tech: Python, PostgreSQL, Alteryx, AWS

Iqvia

April 2019 - Feb 2021

Data Analyst

  • Led a team of 3+ members and developed quantitative models to design solutions that help analyze penetration and sales impact across targeted & mass media channels: Emails, Direct Mails, Mobile Alerts, Banners, Newsletters, Paid Search etc.
    • Delivered significant ROI growth through successful implementation of Marketing Mix Modeling (MMM) techniques for optimizing marketing campaigns utilizing regression techniques, log transformations, ad stock and lag effects.
    • Drove 10% sales lift by optimizing and measuring omnichannel campaigns through experimental studies (test & control) and tracking KPIs for personnel and non- personnel campaigns for 6+ brands across its life cycle .
    • Designed analytics solution with Python, SQL and Tableau (Dashboard) that automated engagement analysis, campaign effectiveness and ROI insights, reducing time from inception-to-delivery delay by 75%.
  • Developed machine learning models (Random Forest, XgBoost) identifying key drivers of product adoption with 73% accuracy, enabling clients to optimize marketing strategy and boost product adoption.
  • Architected scalable customer segmentation framework integrating K-means clustering and rule-based bucketing; optimized framework slashed analysis time by 50%.
Tech: Python, SQL, Tableau, Excel

Projects

Stylized Speech Synthesis

click here to know more

Cereal Recommender

click here to know more

Fifa'22 Topic Modeling

click here to know more

RFM segmentation and Recommender

click here to know more

ML Pipeline and Churn prediction

click here to know more

Cancer Tumor Keyword Identification

DoorDash order delivery

Applications using Python

WIP

Skills

  • Skills
  • Skills Evaluation

Languages and Databases

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

Frameworks

vectorlogo.zone vectorlogo.zone upload.wikimedia.org

Tools

vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone vectorlogo.zone

85%

Python

81%

R

91%

SQL

62%

HTML

95%

Scikit-learn

72%

Tensorflow

55%

OpenCV

50%

Flask

80%

Tableau

83%

MS-Excel

87%

Alteryx

68%

Heroku