Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

Rahul Sharma

Senior Data Scientist

Professional Summary
8 years of experience in Machine Learning/AI

Technical Skills

NLP, LLM Ops, PEFT, LORA, Fine Tuning, Generative AI (LLM),GPT 4, Lang Chain, Learning Algorithms,
Large Language Models, Advanced RAG, Chatbot Development, Machine Learning, CNN, OpenCV,
Deep Learning, Airflow, GIT, Docker, Hugging Face, AWS ML, Azure , GCP, Cloud Practices, Py Torch,
TensorFlow, Predictive Modelling, Feature Engineering, Statistics, Business Analytics, Data
Engineering, Communication, Leadership, Python, Agile environment, DevOps.

Work Experience:
 Working as Senior Data Scientist at Nucleocoders Technologies Pvt Ltd from Aug 2023
to present.
 Worked as Software Engineer at Onward Technologies from March 2020 to July 2023.
 Worked as Software Engineer at Twyst Technologies Limited from Aug 2016 to April 2020.
 Worked as Business Analyst at Parkzap Labs from July 2013 to July 2016.

Projects

Project 1

Description: Chatbot on Open-Source McKinsey documents with Advanced RAG strategies


Duration: Aug 2023 to present
Technologies Used: Generative AI (LLM), Lang Chain, Large Language Models, Azure, OpenAI, Data
Warehousing, Synapse DW,Agile
Roles & Responsibilities:
 A POC was developed using Cohere Contextual Compressor reranking, with Cohere
embeddings, was used with GPT 3.5 turbo 16k LLM to create a chatbot with guardrails and
evaluator.
 A working demo on Stream lit is available. (Streamlit, EC2, Cohere, OpenAI, Nem
guardrails, Trulens)

Project 2

Description: FAQBot on Insurance Agent Question Answers


Duration: Jan 2023 to July 2023
Technologies Used: Azure, Hive, Generative AI (LLM), Data Warehousing, Synapse DW,
AWS,Agile,Python,Devops
Roles & Responsibilities:
 With CSV data as source, sentence-transformer embeddings, FAISS indexing, a FAQbot was
 created with OpenAI as LLM.
 The idea was to give an interactivity to the FAQ page of the website. (AWS,
sentence- transformers, SageMaker, Lambda)
Project 3

Description: CourseBot
Duration: May 2020 to Dec 2022
Technologies Used: AWS ML, Cloud Practices, Py Torch,Tensorflow, Agile,Predictive Modelling, Azure,
Hive, Data Warehousing, Synapse DW,Python
Roles & Responsibilities:
 Designed and developed data pipelines, data warehouses and data marts to integrate
new data sets from different sources into a data platform.
 The project involved creating an advanced Bot capable of answering user’s questions
on Tabular Data of courses with fields such
 as coursename, Number of lectures, Rating, Description etc. (Langchain, PGVector,
Postgres, OpenAI, Pinecone)

Project 4

Description: Unveiling Multifaceted Insights through NLP: Analysing Newspaper Articles


Duration: Jan 2019 to April 2020
Technologies Used: Azure, Hive, Data Warehousing, Synapse DW, NLP,Devops,Algorithum
Roles & Responsibilities:
 Designed and developed data pipelines, data warehouses and data marts to integrate
new data sets from different sources into a data platform.
 Wholly understand the articles, through seamless integration of a variety of NLP
techniques, from text summarization and sentiment analysis to topic modelling and named
entity
recognition.
 Text Summarization with BERT, Sentiment Analysis with Distil BERT, Topic Modelling with
BER Topic, Weekly Sentiment Analysis, Named Entity Recognition (NER) for Key Figures.

Project 5

Description: Transplant Wait Time Prediction


Duration: Aug 2016 to Dec 2018
Technologies Used: Azure, ETL, Data Warehousing, Big Data,Devops,Algorithum,Python,Agile
Roles & Responsibilities:
 Created a statistical model to predict waiting time for prospective transplant recipients
using survival analysis methods.
 Performed rigorous statistical tests on the dataset, feature engineering.
 Applied cox proportional method to achieve a respectable c-index of 0.68

Project 6

Description: Pneumonia Detection on RSNA CXR


Duration: Aug to 2013 to July 2016
Technologies Used: Azure, ETL, Py spark, Python, Big Data,Algorithum,Agile,Python
Roles & Responsibilities:
 Analysed medical images of the DICOM kind and performed EDA on the challenging
RSNA dataset.
 Used various models, including Inceptionv3, Yolov5, Yolov7, a modified version of U-Net.
 M-RCNN to classify, detect, segment, and instance-segment opacity regions.
Education:

IIIT Delhi - PG in Computer Science and Artificial Intelligence.


IIM Indore - Integrated Program in Business Analytics
IIT Roorkee – Btech

You might also like