Capstone Reading Format

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 1

FlordelineA.

Cadeliña IT ResearchMethods October 22,2016 Research Summary


Revised 2.2
Title: NEAR REAL-TIME PROCESSING OF PROTEOMICS DATA USING HADOOP
by: Chris Hillman, Yasmeen Ahmad, Mark Whitehorn, and Andy Cobley

Area Type of # of Data Method(s) Theoretical Problem/Issue Results of the Additional


Recommendation
Context Research Examined Used Foundation being addressed Study Knowledge/Insights

Near real- Qualitative 4 (Hadoop, Java Investigation XML file Solution on data The 2D and 3D There are many A properly
time Research Code, using the conversion, management and peak-picking areas still to be designed and
processing MapReduce, 2D technique 2D peak processing facing process fits very researched in this researched process
solution and 3D peaks) from the big picking, De- the life sciences well into the process, will allow future
using data. isotoping 2D community on MapReduce including the work to take
MapReduce peaks, data that is Programming SILAC pair/triplet advantage of
and Data 3D peak preprocessed framework. detection and, technical
Hadoop. processing on picking, and before any importantly, developments
mass 3D isotopic biological insight. Data to be the database without
spectrometer envelopes redistributed on search that having to
as raw file in a dataset that has identifies the revalidate and
vendor binary been greatly peptides by their redesign the
format that reduced by the mass methodology
will convert map task. and ties the for processing raw
files into XML peptides to a mass spectrometer
and mzML given protein. data into
format. actionable
The process information.
coded in the
MapReduce
framework will
allow
timings to be
taken and
compared across
platforms and
Hadoop
configurations.

You might also like