Professional Documents
Culture Documents
Report
Report
Markus Tenghamn
Kvarnbacksvägen 5
72233 Västerås
+46 73 995 06 88
mtn11005@student.mdh.se
1. SUMMARY 4. TECHNOLOGY
This report summarizes the work of Markus Tenghamn on Several different languages were looked at for this project
a web parsing application for Cleaning Services. Cleaning to see which might be the most efficient at accomplishing
Services were in need of a tool to make it easier to gather the task at hand. To begin with, a simple test to try to find
information used for marketing. This application is a result out which language was the best was laid out. This
of that need and close work with the company. The involved starting an instance of the application or script
application is written with PHP and utilizes several external needed to run the parser, downloading the html from a
libraries and tools to accomplish the tasks presented. The specific URL and finding an element in the DOM. 100 tests
result is a fully functional web application which went were done for each language with 1 initialization and 100
beyond the initial requirements. requests each time. Requests were performed on the same
computer to attempt to eliminate as many anomalies or
2. INTRODUCTION errors as possible. The averages of the findings are shown
A web application was created with PHP which uses below.
external websites and API’s to gather information which
can later be exported and used for marketing. The PHP
application uses parsers to gather specific information from Average time to initialize: 1.057 sec
these websites which it then stores in a database along with Average time for a request: 0.971 sec
information gathered from various API’s. The information Python
can then be exported in an Excel, PDF, or text format Average time to initialize: 1.443 sec
which follows the format used by Posten to import shipping Average time for a request: 0.161 sec
addresses.
C#
3. BACKGROUND Average time to initialize: 2.480 sec
The project requirements were laid out by Cleaning Average time for a request: 0.062 sec
Services whom needed a better way of collecting
information, used for marketing, from real estate listings.
This had previously been done by hand and the company
The results were surprising as C# seemed to be the fastest
was now looking to build a web application that would
at completing the task once it had been initialized and PHP
automate the process.
being the fastest time to initialize but much slower request
times. The expected result was that all of the languages
would have almost equal performance times. The data may
not be completely accurate as internet speeds and loads on
the tested website may have affected the time of requests.