Reference Parsing

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 1

Reference Parsing

The reference is a source of information to ascertain the fact stated. One can find the references
in any book, article, research paper, patent etc. Reference Parsing means to separate any given
reference into different parts like authors, conference name, journal name, article title, publication
year etc.

In this problem, you are given an input file which consists of some sample references, you have
to split a reference chunk into following categories:
1. Author: First name and last name of all the given authors, there may be two or more
authors.
2. Article Title: Extract the title of article from reference text
3. Conference/Journal: Capture the name of conference/journal under which article is
published
4. Volume: Volume of the journal if any
5. Page: Page range of the journal if any
6. Year: Publication year
7. Other: Any other information

Note: Feel free to add any category if you find it important enough.

Input File: Double click on “sample_references.txt” to view the sample file

sample_references.txt

Sample input:
Beaubouef, T., & Mason, J. (2005). Why the high attrition rate for computer science students: some
thoughts and observations. ACM SIGCSE Bulletin, 37(2), 103-106.

Sample output:
Author 1
first name T.
last name Beaubouef
Author 2
First name J.
Last name Mason
Title Why the high attrition rate for computer science students: some thoughts and observations
Conference/Journal ACM SIGCSE Bulletin
Volume 37
Issue 2
Page 103-106
Publication Year 2005

You might also like