Professional Documents
Culture Documents
Pass 3 - Expected Volume and Database
Pass 3 - Expected Volume and Database
Pass 3 - Expected Volume and Database
To calculate the size of the target market in corporations, we begin by looking at the
number of corporations in the Philippines. DTI reported a total of 957,620 business
enterprises in the Philippines. This includes large corporations which make up 0.49% and
MSMEs which make up 99.51%. However, the platform will be more suitable for small,
medium, and large corporations. This is given that micro corporations only have 1-9
employees which may not find much value in the platform, since the platform is mainly
utilized for meetings. Accounting for this, it leaves 107,493 corporations from large to small
enterprises as potential users. The total number of employees from these sectors amounts
to 6,064,164 people. However, certain sectors may not utilize the platform like
manufacturing, and not all employees/departments need to use the platform. Other
corporations may have other reasons such as not being able to afford a platform or lack the
technological adoption to implement it in their organization. This means user penetration
should be much lower at about 50%. This leaves about 3,032,082 employees who serve as
potential users of the platform.
For schools, there are around 60,744 schools across different educational levels in
the Philippines. Assuming that public schools will be unable to afford the platform(unless a
partnership is done with the government or CHED) due to a lack of budget, this leaves about
13,132 schools (from private, LUC/SUC, and PSO). The DepEd estimate for the number of
teachers and personnel in these schools in 2020 was placed at 300,000. However, as with
any platform Transcribe.Ai will not necessarily be able to gain 100% user penetration in this
segment. These schools may a variety of reasons to not use the platform such as choosing
other alternatives, lacking budget, or lack of technology adoption. As such, an optimistic
estimate would be a 70% penetration for the platform at its peak. Multiplying this user
penetration to the remaining schools and teachers would leave about 9192 schools as
potential subscribers to the platform which leads to 210,000 personnel and teachers as
users.
The last target market would be users using the platform for personal use. However,
this will mainly be composed of students, since it is this type of user profile that would have
the most value for the use case of the platform such as for note-taking and summaries.
Considering that the platform may be too expensive for most public school students, only
non-public school students will be used in the estimate. According to DepEd, this estimates
3,332,054 students. However, not all students would use the platform for a variety of reasons
such as lack of budget, not caring enough to utilize the platform for studies, and more. As
such, the penetration for the platform would be around 40%, which leaves 1,332,822
students as potential users.
Overall, the total addressable market of the platform in the Philippines would be
estimated at 4,574,904 potential users. However, given that the platform will only begin
launching this means that it will unable to fully capture the entire market, and will likely
experience users churning at the earlier stages of the platform (due to bugs and user
interface being improved). For estimation purposes, Transcribe.Ai targets to have 10,000
users by year 1. This is because it will still be in its beta phase as it focuses on the
development of the platform, improving the user interface, and fixing bugs encountered. It
will then follow a 145% annual user growth rate following the industry average of Saas
startup, which will gradually decrease as the company reaches maturity. As such, in
succeeding years, the annual user growth rate will decrease at a constant rate (i.e. 145% in
Year 2, 135% in Year 3, 123% in Year 4). In addition, the churn rate of users will be included
in the estimate following the industry average with an annual churn rate of 8% observed in
subscription services. If the company finds success in the Philippines, it may begin its
expansion into other countries such as South East Asia.
Figure 1. User Forecast for Transcribe.Ai
Field Characters
Password 10**
Phone Number 10
The next big files would be those related to the account or platform features. This
means user specific features that are provided by the platform, and the general features of
transcribing files of the platform. For Account Plan, this was calculated by summing the
character lengths of the different plans available on the platform (Basic, Pro, Business and
Enterprise), and then dividing it by 4 which would be 6.5 characters. This estimate can be
adjusted in the future depending on the number of users who avail of each plan. Next, the
custom or personal vocabulary is a platform feature where the user can create their custom
word to be transcribe(Ex. “FF” can be a customer word which in gaming context means
surrender). Given that the average english word is around 4.5 characters, this was rounded
up and then multiplied by an estimate of 10 custom words per user. This is followed by the
account referral link where users can refer other people to the platform for benefits or
promotions. The account referral link was estimated to be 28 characters, following the length
of typical tinyurl url. In addition, file names would also be included for the transcribed files
with character lengths of up to 16, and with an average files of 20 per user. In line with the
transcribe files which are categorized by content, there will be an average characters per
category would be 10, with an average of 5 categories per user.
Field Characters
In order to minimize costs of storing audio files in the platform’s database, a hard limit
of 2GB will be implemented as the maximum capacity for all the audio files. According to
Standford, 1MB converts into around 1 minute of playtime in a mp3 file. This 2GB maximum
storage capacity per account then converts into around 2000 minutes of audio playtime. If a
storage restriction was not implemented this may lead to excess cost in storing these files in
the database. However, there are two alternatives to increase the storage cost. First,
Transcribe.Ai can offer plans or add-ons which increase the storage capacity of the account.
This allows Transcribe.Ai to not loss profits through carrying excess amounts of costs in
storing files. Second, Transcribe.Ai can integrate with your existing google drive or dropbox
account through which the audio can be stored through. This allows users to not have to pay
an increase amount for going over the storage limit, while also reducing costs on the side of
Transcribe.Ai. Instead the cloud storing platform such as google drive or dropbox will bear
the cost of storing these files. With these alternatives, an estimate of 1GB out of 2Gb will be
utilized by users on average. This is also considering the fact that 1GB converts to 1000
minutes of playtime or 16.7 hours, which not all users will be able to fully utilize (due to its
lengthy playtime). In addition, a 10MB will be allocated for the personal voiceprint of the
user, which uses machine learning to be able to more accurately transcribe the users voice.
10MB was allocated since user will be asked to read a 10 minute pre-determined script
which the computer can utilize in better analyzing and detecting the users voice.
Table 3. Audio Files Sizing Estimate
Audio Files
The total costs required in order to have the storage capacity that can fill Transcribe.Ai’s userbase has been forecasted below. For using
a cloud database such as AWS, the AWS calculator was used to estimate the total cost required to carry the storage capacity. For each year,
the AWS calculator was used to estimate the total cost (with the url listed in the references below). As for using the a physical hard disk, a per
unit basis of 8TB for a single hard disk from Alibaba was used which cost $259. This is because buying in terms of the 8TB was the most
optimal in terms of pricing where 1TB = $32.4 as compared to other variants such as the 1TB hard drive where 1TB = $49. In terms of only
storage, it seems that have a physical storage with hard disk are relatively cheaper to having a server through AWS. It can be see from Year 1
that alone that the Hard Disk costs was only at P14,913 vs the P732,498 of AWS. As such, in terms of purely cost, Hard Disk would be the best
choice. However, other factors are yet to be considered such as benefits of cloud, maintenance cost, physical location, and more. This will be
explored in Pass 4.
Borysko, N. (2021, April 5). Average saas growth rate: Brief guide for startups. Eleken.
Retrieved May 11, 2022, from
https://www.eleken.co/blog-posts/average-saas-growth-rate-brief-guide-for-startups#:~:
text=It's%20typical%20for%20many%20startups,%2Dto%2Dyear%20growth%20range
.
Llego, M. A. (2021, August 24). DepEd Basic Education Statistics for school year
2019-2020. TeacherPH. Retrieved May 11, 2022, from
https://www.teacherph.com/deped-basic-education-statistics-school-year-2019-2020/
https://www.business2community.com/strategy/how-to-beat-the-average-churn-rate-for-subs
cription-service-enterprises-02158484#:~:text=Average%20churn%20rates%20for%20subsc
ription,behind%20your%20current%20churn%20rate%3F
https://resources.infosecinstitute.com/topic/beyond-password-length-complexity/#:~:text=Mo
st%20of%20the%20passwords%20(61,numbers%20and%200.2%20special%20characters.
https://www.researchgate.net/figure/First-names-and-last-names-lengths-distributions_fig1_3
28894441#:~:text=The%20median%20was%206.5%20characters,characters%20for%20the
%20last%20names.
https://baymard.com/blog/form-field-usability-matching-user-expectations
https://www.wyliecomm.com/2021/11/whats-the-best-length-of-a-word-online/#:~:text=The%
20average%20word%20in%20the%20English%20language%20is%204.7%20characters.
https://www.gmrtranscription.com/blog/interesting-transcription-facts-and-statistics-at-a-glanc
e
https://web.stanford.edu/class/cs101/bits-gigabytes.html
https://calculator.aws/#/estimate?fbclid=IwAR1X2jqB8tpR1H8cJlxBeR0YdHXgs2pKihtBU-2R
y8oGaSn9nGF1EupwqtY
https://calculator.aws/#/estimate?id=e843b18ac8af12fafae1454b616b0d8b33ffea7b
https://calculator.aws/#/estimate?id=6c3431d65ca21637ca0a41e012dc9435b9528a12
https://calculator.aws/#/estimate?id=91576dedf93faf8ef87218be793857f6d94e4359
https://calculator.aws/#/estimate?id=d839bbffe2cd8d57a58f246310d6847795b249c6
https://calculator.aws/#/estimate?id=2b1837812d12b65560e820b2fb4415551429e47a
https://calculator.aws/#/estimate?id=86d39bdd74d8f57fdebb5395225e22542108f7db
https://calculator.aws/#/estimate?id=13fc036fa1f078dcaaab03ff8cf4c62520504723
https://calculator.aws/#/estimate?id=aaa9aec600e0aeaee1a92bd75d219ddf6c4f9cb6
https://calculator.aws/#/estimate?id=80ca289f3791dabb19e7cc08d1a327089a175f91
https://calculator.aws/#/estimate?id=066b61d7ac35f9448a4f004b4162abbfce2f2fa3