Professional Documents
Culture Documents
Data Analyst - Data - Scientist - ML - Engineer
Data Analyst - Data - Scientist - ML - Engineer
com
Associate Data Scientist / Data Analyst +91-9163775855/9883269497
https://www.linkedin.com/in/pranab-kumar-manna-7436a3b9/ Kolkata, West Bengal, India
EXPERIENCE
Experience Summary:
Total 3 years of competitive experience in IT industry using Python, R ,Aws Technology (Data
Bricks,Athena,S3,Kinesis,Data Pipeline, Redshift, Sagemaker), MySQL, NoSQL, Hive, Tableau, NLP,
Graph Database (Neo4j), Image Processing, Deep Learning, Data Modeling, Django, Scala, Statistics, ML,
AI, Data Framework (Spark, Hadoop), Tensorflow, scikit-learn, Google Dialogue Flow, Operation Research .
Experience of working in Automate Reports , build Predictive Model and analyze the large amount of pharma data to
discover trends and patterns.
Good work ethics with excellent communication, presentation and interpersonal skills.
Capable to delve into the new Leading Technologies.
Ability to work well both in a team environment as well as individual.
Working Experience:
Company Name Location Designation Tenure
AstraZeneca India Pvt. Ltd. Chennai Data Science Junior Associate 2 Years 3 Months
Accenture Solution Pvt. Ltd. Bangalore Data Science Analyst 11 Months
Domain Knowledge:
Pharma
PROFESSIONAL EXPERIENCE
Role Developer
Team Size 4
Company Name Accenture Solution Pvt Ltd
Identify valuable data sources and automate collection processes.
Undertake preprocessing of unstructured data.
Create a Mixed Effect Regression to identify the coefficients for each channel.
Propose solutions and strategies to business challenges with data.
Responsibilities Involved in bug fixing.
Involved in estimation for new enhancement.
Description : Developed marketing mix analysis for various marketing channels by building machine learning models to
obtain contributions and impact of media channels at National level along with tool to optimize the current marketing
spend to maximize revenue. Basically to estimate the impact/effectiveness of marketing channels and to optimize
marketing investments across channels for generic market
Clinical Trial Assessment Optimization
Duration July, 2021 – Sep,2021
Technologies Python, PyLDAvis , sklearn, gensim, spacy, vader, seaborn , wordcloud , NLP
Role Developer
Team Size 2
Company Name Accenture Solution Pvt Ltd
Identify valuable data sources and automate collection processes.
Undertake preprocessing of unstructured data.
Create A Topic Modeling to identify the clusters.
Presents information and showcase the most talked topics by the customers.
Responsibilities Propose solutions and strategies to business challenges.
Involved in bug fixing.
Involved in estimation for new enhancement.
Description : Collect the unstructured data from reddit and make the collection automated using reddit api and ran that
code in the GCP over night . After then we use some LDA (Latent Dirichlet Algorithm) to create clusters and based on
coherence graph decided the optimal number of clusters and create a ppt out of it with insights to present in front of the
stake holders.
HCP Segmentation
Duration Nov, 2019 – Feb, 2020
Technologies Python, Neo4j, Allen NLP, Pubmed Data Source, Beautifulsoup
Description: Collect unstructured HCP (Health Care Profession) data from different sources like Pubmed and clean the all
data, used features engineering to extract import features from the data. We then visualize the data to find the Key
Opinion Leader (KOL).we directly promote our brand to KOL .In this way we can do cost-optimize and reduce man hours
to reach the maximum audience.
Reporting Framework
Duration July, 2019 – July,2021
Technologies Python, Pandas, Numpy, Python-pptx, Python-docx, Openpyxl
Role Developer
Team Size 2
Company Name Astrazeneca India Pvt Ltd
Understand data sources and business logic.
Undertake preprocessing, data cleansing, data wrangling of structured and unstructured
data.
Automate the power point, excel and docx reports.
Responsibilities Analyze large amount of data to discover trends and patterns.
Propose solutions and strategies to business challenges.
Involved in bug fixing.
Involved in estimation for new enhancement.
Description: Using python script automate the pptx, excel and docx reports. Initially reports were creating manually
using excel. After automating all the reports, It reduced man hours by 10-12 hours per reports.
EDUCATION
Pursuing Online B.Sc. In Programming And Data Science : Madras, Jan 2022- Jan 2025
IIT Madras
Post Graduate Diploma In Big Data Analytics (PGDBDA)
Centre For Development Of Advanced Computing (CDAC) Pune, Aug 2018 – Feb 2019
Master In Computer Application (MCA)
Rajabazar Science College (Calcutta University) Rajabazar, Kolkata, Aug 2012 -July 2015
B.Sc. (Honours) In Mathematics
Thakurpukur Vivekananda College (Calcutta University) Thakurpukur, Kolkata, June 2009- July 2012