Professional Documents
Culture Documents
Synopsis Major Ipuranklist
Synopsis Major Ipuranklist
On
IPU Ranklist
B.TECH.
(COMPUTER SCIENCE ENGINEERING)
Submitted By:
EKTA
Roll No. 41215002716
BHUMIKA SINGH
Roll No. 02215002716
ANKUSH GARG
Roll No. 01415002716
The semester wise result that we get on university website is in form of pdf which shows
result of a branch in entire university. The idea is to create a parser that extracts marks of
each student in the pdf and store them in a database. This will help in aggregating each
student’s marks at an application and thus form a rank-list semester wise/college
wise/university wise.
Introduction
The university/college results are declared in form of marks subject-wise in pdf format. So,
the data from pdf is to be scraped using any programming language. The raw-strings of data
from the pdf has no meaning. To give a meaning to it, a parser has to be designed which will
kill the painful task of aggregating the data object-wise, stream-wise etc. The parsed data, i.e.,
array of objects(students) is then stored in a database from which API call will be made to
show results on UI. Therefore, there two main components in the project for showing the
result – pdf scraper and parser.
● PDF Scraper: Pdf scraper is a tool that extracts or imports data/information from pdf
document in a local file. PDF.js library in JavaScript is a scraper library that renders
PDF files using the web standards-compliance HTML5 canvas. PDF.js can work as a
part of a website or of a browser. It is also available as a browser extension.
After semester exams, we had to calculate our total marks and then percentage from the list in
large pdf result file. Again, it was cumbersome and almost impossible to find one’s
class/branch/university rank just by looking at the pdf file because it contains hundreds of
pages. This brings us to the idea of generating an automated list of marks and ranks of each
student from the pdf file. The main issue again is to bind all kind of data from pdf file, i.e.,
consistent, inconsistent (some pages have fully filled tables, some don’t or some pages
contain only information about subjects not actual marks).
[1] In this paper, we describe our experiences using Twitter Bootstrap and Angular JS
frameworks to address this balance between design and implementation, empowering
developers to create better styled and easily maintainable websites.
[2] In this paper, we propose a UML profile for AngularJS for building a model of an
AngularJS web application, and a set of transformations that transform the model into a code
template. The developer can then fill in the template to make a complete workable web
application.
[3] says that node.js is one of the more interesting developments recently gaining popularity
in the server-side JavaScript space is Node.js. It's a framework for developing
high-performance, concurrent programs that don't rely on the mainstream multithreading
approach but use asynchronous I/O with an event-driven programming model.
The goal of paper [4] is to capture the essence of TypeScript by giving a precise definition of
this type system on a core set of constructs of the language. JavaScript remains a poor
language for developing and maintaining large applications. TypeScript is an extension of
JavaScript intended to address this deficiency.
The paper discussed in [5] shows the variation in the performance along with the change in
the modeling style with reference to normalization and embedding and it gives the base to
find the extent of normalization and embedding for reducing query execution time.
In paper [6], the characteristics of the data logic model of NoSQL database MongoDB and
the mode design principles were analyzed, then a method based on anti-normal form model
was proposed.
Objective
Objective of IPU Ranklist is to create a minimalist result website where students can see their
subject-wise marks, total marks, class-wise rank and percentage. It would save a lot of time
for students as well as teachers to account their class results.
● PDF Scraper(PDF.js)
● Parser
● MongoDB
● Web Application
[3] Stefan Tilkov ; Steve Vinoski, “Node.js: Using JavaScript to Build High-Performance
Network Programs”, IEEE Internet Computing , Volume: 14 , Issue 6 , Nov-Dec 2010.
[5] Anuradha Kanade ; Arpita Gopal ; Shantanu Kanade, “A study of normalization and
embedding in MongoDB”, IEEE International Advance Computing Conference, 2014.
[6] Yunhua Gu ; Shu Shen ; Jin Wang ; Jeong-Uk Kim, “Application of NoSQL database
MongoDB”, IEEE International Conference on Consumer Electronics – Taiwan, 2015.