SaugataPaul DS AIML

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Saugata Paul

DATA SCiENTiST · COMPUTER ViSiON, NLP, TiME‑SERiES FORECASTiNG, MLOPS


16, Niharika Abasan, Bhuban Mohan Roy Road, Kolkata, West Bengal, India. PIN - 700008
 (+91) 973-982-0993 |  (+91) 896-140-1973 |  saugata.paul2020@gmail.com |  saugatapaul1010 | 

saugata-paul-06413b126/ |  saugata.paul1010

Summary
Experienced Data Scientist with experience in building, serving, and monitoring data science applications with experience in designing/implementing
cloud based solutions. I have a strong understanding of various statistical and machine/deep learning techniques, with extensive experience in Python
and SciPy‑Panda‑NumPy‑Sklearn‑Tensorflow‑Keras stacks. I have experience in version control, git, containerization/virtualization of applications using
Docker, Kubernetes. I have fluent English communication skills and a good ability to work closely with team members, senior management and business
stake holders.

Work Experience
Atos‑Syntel, R&D Bangalore, India
ASSOCiATE CONSULTANT ‑ DATA SCiENCE Apr. 2021 ‑ Present
• Develop and operationalize algorithms, models, and applications to support machine learning research and product development.
• Partnered closely with domain experts to develop a Time‑Series forecasting tool for forecasting and preventing crucial batch jobs from failing.
• Conceptualized the architecture and developed an NLP‑based CI/CD pipeline in Azure DevOps and with ML‑OPS SDKs. Created an end‑to‑end appli‑
cation in Django, for one‑click model training & deployment.
• Developed a BERT‑based question‑answering framework using Facebook Haystack. Built a Sentiment Analysis pipeline using voice data from a leading
call center which can identify and highlight negative sentiments in real‑time.
• Apply data storytelling skills to translate data insights into business recommendations, & help organizations make data‑driven decisions.
ThirdEye Data Kolkata, India
DATA SCiENTiST Jan. 2021 ‑ Apr. 2021
• Spearheaded the development of a system for Southern California Edison, that can identify faulty electric poles across the state of California, thereby
reducing the risks of forest wildfires that might result from non‑maintenance of these poles. Images were captured using drones. Developed object
detection and image segmentation models and deployed the system using Microsoft Azure.
• Built a system for Glas Trösch Switzerland, by leveraging AI/ML technologies, to predict the metrology results of coated glass, performed predictive
analysis & maintenance, and give recommendations for predictions that falls outside an acceptable quality range.
Straive (SPI Global) Chennai, India
DATA SCiENTiST Sep. 2019 ‑ Dec. 2020
• Part of a computer vision team that successfully generated $ 2M project titled ’E‑Crash’, for a leading USA client. Did requirement analysis, prototyping,
stakeholder discussions & deciding KPIs.
• Initiated & developed computer vision and text processing pipelines for ’E‑Crash’, to extract information from Traffic Accident crash reports for all
the US counties. Used TensorFlow Object Detection API to build object detection models, which is used to predict more than 500 unique objects.
Deployed the models using Docker. Reduced the current business workflow by more than 70%.
• Developed and managed a Computer Vision System for a leading firm in India, to detect objects in credit card images, which is used by the business
team to create databases and check for loan defaulters, automatically.
• Developed a deep learning system for a leading transportation company in the USA, that can detect traffic signals & road signs from images captured
by a business team, and create a map database that stores all this information along with their geo‑location coordinates.
Applied Roots, AAIC Technologies Hyderabad, India
MACHiNE LEARNiNG INTERN Jul. 2019 ‑ Sep. 2019
• Formulated Deep Learning pipelines for Image Classification and used it to design, develop & deploy a web application for medical image analysis that
can classify 6 types of diseases in real‑time ‑ Malaria, Brain Tumor, Pneumonia, Diabetic Retinopathy, Breast Cancer, Optical Coherence Tomography.
• Developed a classification model that can identify & classify malware into 9 classes by analyzing their ASM and Byte file details. 500+ GB of highly
imbalanced malware data were used to analyze and train the model. Used SelectKBest for feature selection.
• Implemented a research paper to build and optimize a multi‑label text classification algorithm for predicting movie tags/genres using Plot Synopsis
Movie Dataset with high micro averaged F‑1 scores.
• Implemented a research paper published by Nvidia, to build an end‑to‑end Self Driving Car system using Open‑CV, using data collected from San
Francisco Bay Area. Trained a CNN to map raw pixels from a single front‑facing camera directly to steering commands. The system operates at 30
frames per second.
Infosys Bangalore, India
SENiOR SYSTEM ENGiNEER Dec. 2015 ‑ May. 2018
• Developing and maintaining a web service application for a leading legal firm in the USA, contributed to its coding, testing, building, development,
integration, defect fixing and prevention, write test cases in selenium.
• Researched and built ML Inference Pipelines for classification of ServiceNow Ticket severity, analysis, and deriving insights from production logs using
regex‑based text processing techniques and ML algorithms like K‑Means clustering, DBSCAN, frequency distribution, word clouds, etc.
Tools
Tools & Skills
• Python, Django, Flask, GIT, Linux, RESTful API, Docker, Microsoft Azure, Keras, TensorFlow, SK‑Learn, Matplotlib, Seaborn, SQL, Pandas.
• Classification, Regression, Clustering, Time‑Series Forecasting, Object Detection, Image Segmentation, Deep Learning, Text Extraction.
• Neural Networks, TensorFlow Object Detection API, Fast R‑CNN, Faster R‑CNN, YOLO, OpenCV, PIL, Mask R‑CNN, LabelImg, Pytesseract, PyPDF2, OCR,
Poppler, Scikit‑Image, SciPy, Numpy, Nvidia CUDA.
• NLTK, Spacy, Gensim, Hugging‑Face, BERT, LSTM, Facebook Prophet, XGBoost, Splunk.
• Hyperas, Hyperopt, TPOT, K‑Fold, Hypothesis Testing, Statistics, Feature Engineering, Feature Selection, Data Analysis.
• Docker, Microsoft Azure ML Studio, Azure DevOps, Azure MLOps CI/CD, AKS, Heroku, MLFlow, DVC Studio.

Blogs & Personal Projects


Medium Kolkata, India
CONTENT WRiTER Nov. 2018 ‑ Present
• Creating and deploying Deep Learning based android and web applications for medical image analysis.
• A case study on Malaria detection using cell images and Deep Convolution Neural networks in Keras.
• A detailed case study on Multi‑Label Classification with Machine Learning algorithms and predicting movie tags based on plot summaries.
• Ensemble Learning — Bagging, Boosting, Stacking and Cascading Classifiers in Machine Learning using SKLEARN and MLEXTEND libraries.
GitHub Kolkata, India
DEVELOPER & CONTRiBUTOR Aug. 2016 ‑ Present
• An end‑to‑end deep learning‑based web and android application for medical image analysis.
• Open‑Source Deep Learning Pipeline for Image Classification.
• Microsoft Malware Detection using BYTE and ASM files.
• Multi‑label text classification framework for predicting movie tags/genres using Plot Synopsis Movie Dataset.
• Building an end‑to‑end Self Driving Car algorithm for US Roads, using data obtained from San Francisco Bay Area.
• Clustering algorithm combined with Time‑Series networks, to forecast yellow taxi demand prediction in 10‑minute intervals, in New York City.
• Personalized Cancer Detection framework using classical Machine Learning.
• Human Activity Recognition, using data obtained from FitBit watch sensors.
• Amazon Fine Food reviews Classification and Sentiment Analysis.
• Netflix Movie Recommendation Engine using Truncated‑SVD and Matrix Factorization.

Certifications & Courses


Certifications
• Microsoft Certified Azure AI 900.
• AAIC Certified Applied AI Engineer.
Coursers
• Artificial Intelligence A‑Z.
• Deep Learning A‑Z.
• Machine Learning A‑Z.

Education
Government College of Engineering and Ceramic Technology (MAKAUT | WBUT) Kolkata, India
B.TECH. iN COMPUTER SCiENCE AND ENGiNEERiNG Aug. 2011 ‑ May. 2015
• CGPA ‑ 8.07
Vivekananda Mission School Kolkata, India
ISC, INDiAN SCHOOL CERTiFiCATE Apr. 2008 ‑ May. 2010
• Score ‑ 91.25 %
Vivekananda Mission School Kolkata, India
ICSE, INDiAN CERTiFiCATE OF SECONDARY EDUCATiON Apr. 1994 ‑ Mar. 2008
• Score ‑ 92.60 %

Hobbies & Interests


• Passionate about Landscape Photography, Long Exposure Photography, Post Processing.
• Extremely passionate about travelling, visiting different places and exploring different cultures.

You might also like