Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 5

Examining DocDigitizer AI startup status and future expansion.

Birzhan Iskakov
ALY6980 – Capstone
Dr. Matt
22.01.222
Introduction or background.

Even though increase of data is beneficial, organizing and mining all of this information
remains a challenge. Despite the fact that artificial intelligence has been a significant part of data
analysis over the last decade, organizing such data remains a tough challenge (Mansmann et al.,
2010). As a result, the research focuses on data analysis techniques of mining data for the AI
start-up of DocDigitizer and evaluating it to provide useful information or reports that will assist
the DocDigitizer in determining the position in the competitive world of business. This will
provide DocDigitizer with an option as to what has to be addressed in detail and what still needs
to be improved for market sustainability.

Purpose statement.

The importance of discovering future firm expansions connected to data consumption and
processing for the Sponsor, who works with artificial intelligence processing focused on
documents, cannot be understated. Their flagship solution uses natural language processing
(NLP) to process and automate documents with near-perfect accuracy using AI/ML algorithms.
The issue in this scenario is that the sponsor needs to extract crucial information from papers,
which falls under the area of document imaging. To solve the problem with artificial intelligence,
we may use optical character recognition technology in conjunction with natural language
processing to extract the crucial elements required.

Computer vision technologies and neural networks, which are part of deep learning, are
used in optical character recognition. Recurrent neural networks are the sort of neural network
that is being used. For improved performance, these neural networks are multilayered. Using the
matplotlib package, one may view what picture was extracted from the document after the
extraction process. To measure the accuracy of the neural network output, we may compare it to
the predicted output and determine the error rate, after which we can modify the model for
optimal performance. Tuning can be accomplished by adjusting the threshold, modifying the
activation function, or adding more layers to the neural network. The retrieved information can
then be digitized and used in the data analysis stage of the analytical process.

Research question.
The study aims at looking into examining the condition of the business given the relevant
information obtained through mining the data by use of AI/ML algorithms. Therefore, the
information provided will be analyzed using the NLP because of the different languages used by
the customer needs and then give a report on the status and condition of the business in the
market completion. Therefore, the research question will be;

Does the information obtained through data mining for DocDigitizer startup have a positive
impact on the business expansion?

Methods and procedures.

AI/ML algorithms will be employed to mine data and analyze the data for examining the
status of DocDigitizer startup. NLP will also be employed for easy interpretation of the different
version of languages used. The project will collect the required resources from the DocDigitizer
start-up in order to provide a solution to the challenge of determining the firm's market
condition. This indicates that NLP models will be added in order to provide the most suitable
manner of comprehending the many languages that exist.

Literature review.

When AI, advanced analytics, and IoT work together, they deliver rapid, accurate data
about existing and future customers' wants and preferences, which feeds new product ideas. They
developed robots capable of replacing people in industries like as manufacturing, restaurants,
retail, and banking. They created IBM's Watson, which can sift through millions of pages of
research in seconds to provide doctors with data about diagnosis and treatment options, resulting
in better, more affordable healthcare (Kaplan, 2015, p. 150), and Google's Deep Mind program,
which can read lips more accurately than human lip readers (Kaplan, 2015, p. 150). (Chui,
George, & Miremadi, 2017, p. 1). "Automated trading algorithms are now responsible for over
two-thirds of stock market trades," according to the financial industry (Ford, 2015, p. 56). In
customer service, Amazon is testing Echo Look, a device equipped with a camera and
microphone that will provide feedback on how pieces of clothing seem on you. In terms of
goods, 3D printing is used to create a toupee, which is a biomaterial scalp prosthesis that matches
skin and hair color, as well as hair curl and thickness.
Manyika et al. (2013) state that "the advancement of artificial intelligence, machine
learning, and natural user interfaces (e.g., voice recognition) is making it possible to automate
knowledge worker operations that were previously deemed to be impossible or prohibitive for
computers to execute." This innovation has assisted several sectors in decoding the keywords
utilized; nonetheless, it is vital to assess how effectively it performs in languages other than
English.
Reference.
Manyika, J., Chui, J., Bughin, J., Dobbs, R., Bisson, P., & Marrs, A. (2013, May). “Disruptive
Technologies: Advances that will Transform Life, Business, and the Global Economy.”
McKinsey Global Institute. Retrieved from
www.mckinsey.com/business-functions/digital-mckinsey/.../disruptivetechnologies
Chui, M., George, K., & Miremadi, M. (2017, July). “A CEO Action Plan for Workplace
Automation.” McKinsey Quarterly. Retrieved from http://www.mckinsey.com/global-
themes/digital-disruption/a-ceoaction-plan-for-workplace-automation
Kaplan, J. (2015). Humans Need Not Apply: A guide to Wealth and Work in the Age of
Artificial Intelligence. New Haven, Connecticut: Yale University Press.
Keim, D., Kohlhammer, J., Ellis, G., & Mansmann, F. (2010). Mastering the information age:
solving problems with visual analytics.

You might also like