Open navigation menu

Welcome to Scribd!

Project Progress Report

Uploaded by

John Louis Reyes Aguila

0% found this document useful (0 votes)

28 views2 pages

The report summarizes a project using two lung cancer gene expression datasets from The Cancer Genome Atlas and bioconductor to identify genetic features related to lung cancer. The datasets have been preprocessed but integrating them poses challenges. The team aims to unify the datasets, compare findings, and use edgeR to analyze differential expression in the integrated data to strengthen understanding of the relationship between the human genome and lung cancer. There is a risk the datasets cannot be fully integrated if required information is unavailable.

Original Description:

Project progress report

Original Title

Project_Progress_Report

Copyright

© © All Rights Reserved

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

The report summarizes a project using two lung cancer gene expression datasets from The Cancer Genome Atlas and bioconductor to identify genetic features related to lung cancer. The datasets have been preprocessed but integrating them poses challenges. The team aims to unify the datasets, compare findings, and use edgeR to analyze differential expression in the integrated data to strengthen understanding of the relationship between the human genome and lung cancer. There is a risk the datasets cannot be fully integrated if required information is unavailable.

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

28 views2 pages

Project Progress Report

Uploaded by

John Louis Reyes Aguila

The report summarizes a project using two lung cancer gene expression datasets from The Cancer Genome Atlas and bioconductor to identify genetic features related to lung cancer. The datasets have been preprocessed but integrating them poses challenges. The team aims to unify the datasets, compare findings, and use edgeR to analyze differential expression in the integrated data to strengthen understanding of the relationship between the human genome and lung cancer. There is a risk the datasets cannot be fully integrated if required information is unavailable.

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Project Progress Report

Declan Levine, Tong Liu, and Vincent Wu

We will use 2 datasets. The first one is from The Cancer Genome Atlas(TGCA). The goal

of this analysis is to find out what genetic features are related to lung cancer (FireBrowse,

firebrowse.org/?cohort=LUSC#). The other dataset we will use is from bioconductor. It is a

study on lung cancer gene expression. The data was originally published on bioconductor in

2004 (Scharpf R, Zhong S, Parmigiani G (2019). lungExpression: ExpressionSets for Parmigiani

et al., 2004 Clinical Cancer Research paper. R package version 0.24.0).

The first dataset consists of Lung adenocarcinoma gene expressions. The mRNAseq

preprocessor picks the “scaled_estimate” value from Illumina HiSeq/GA2 mRNAseq level_3(v2)

dataset and makes the mRNAseq matrix with log2 transformed for the downstream analysis.

Preprocessing is already done, but the raw data is available if necessary. The other dataset

“lungExpression” is represented as an ExpressionSet and is already preprocessed.

So far we have been able to look at the two sets of data and have begun to look into the

best ways we might be able to integrate the two platforms in a useful way. The challenge lies in

finding the most effective means to this end, which is something we are currently researching.

The goal will be to unify the two studies and report on how their respective findings compare

post-integration.

We hope to strengthen the findings made with the two platforms which will ultimately

help in understanding the relationship between the human genome and lung cancer. We still need

to find the best way to get the dataset from Broad Institute into a workable format in R. Further,

we need to translate these data in an integratable way for further analysis.

We intend to use edgeR in order to complete a differential expression analysis of the

integrated data set in order to compare the prior results.

There is a chance that we will not be able to integrate the two platforms because we lack

information needed in order to do so. First we will attempt to get what is required from the

publishers of the data, or from another publication. If this fails, we can perform an in-depth

analysis on both data sets independently and compare the results.

You might also like

Applied Survival Analysis R
Document245 pages
Applied Survival Analysis R
Mamadou Watt
100% (2)
Quantitative Decisions in Drug Development
From Everand
Quantitative Decisions in Drug Development
Christy Chuang-Stein
No ratings yet
SAS Interview QA
Document12 pages
SAS Interview QA
hpradeep
100% (2)
MAQ - Heng Li
Document9 pages
MAQ - Heng Li
Shantanu Kumar
No ratings yet
Simple R Tools For Genetic Markers Research
Document3 pages
Simple R Tools For Genetic Markers Research
IJAR JOURNAL
No ratings yet
Ijccn02322014 1
Document8 pages
Ijccn02322014 1
Anonymous VM7yct
No ratings yet
Project Plan-Example1
Document3 pages
Project Plan-Example1
d2cc8hbcjc
No ratings yet
Background:: A Modeling and Simulation Web Tool For Plant Biologists
Document3 pages
Background:: A Modeling and Simulation Web Tool For Plant Biologists
sk3 khan
No ratings yet
Current Scenario On Application of Computational Tools in Biological Systems
Document12 pages
Current Scenario On Application of Computational Tools in Biological Systems
Ankit Agrawal
No ratings yet
Topology Based Data Analysis Identifies A Subgroup of Breast Cancer With A Unique Mutational Profile and Excellent Survival
Document6 pages
Topology Based Data Analysis Identifies A Subgroup of Breast Cancer With A Unique Mutational Profile and Excellent Survival
J Luis Mls
No ratings yet
Deepside Reff
Document69 pages
Deepside Reff
shravya v
No ratings yet
LEA: An R Package For Landscape and Ecological Association Studies
Document14 pages
LEA: An R Package For Landscape and Ecological Association Studies
Suany Quesada Calderon
No ratings yet
AS A F C D S P D L: Ystematic Pproach To Eaturization For Ancer RUG Ensitivity Redictions With EEP Earning
Document16 pages
AS A F C D S P D L: Ystematic Pproach To Eaturization For Ancer RUG Ensitivity Redictions With EEP Earning
Austin Clyde
No ratings yet
Straf
Document12 pages
Straf
tauhidj
No ratings yet
Mathematical Model of Classification of Human Genome Data For Breast Cancer
Document12 pages
Mathematical Model of Classification of Human Genome Data For Breast Cancer
sukanya samanta
No ratings yet
Supervised Learning Approach For Human Liver Cancer Diagnosis
Document10 pages
Supervised Learning Approach For Human Liver Cancer Diagnosis
Rakeshconclave
No ratings yet
New04 Thefuture Sequence To Expression Modells
Document12 pages
New04 Thefuture Sequence To Expression Modells
sznistvan
No ratings yet
A Disease Prediction by Machine Learning Over Bigdata From Healthcare Communities
Document3 pages
A Disease Prediction by Machine Learning Over Bigdata From Healthcare Communities
Harikrishnan Shunmugam
No ratings yet
Logic Synthesis for Genetic Diseases: Modeling Disease Behavior Using Boolean Networks
From Everand
Logic Synthesis for Genetic Diseases: Modeling Disease Behavior Using Boolean Networks
Pey-Chang Kent Lin
No ratings yet
Pacific Symposium On Biocomputing 2014
Document12 pages
Pacific Symposium On Biocomputing 2014
quimiza
No ratings yet
An In-Silico Approach Leads To Explore Six Genes As A Molecular Signatures of Lung Adenocarcinoma
Document32 pages
An In-Silico Approach Leads To Explore Six Genes As A Molecular Signatures of Lung Adenocarcinoma
mostafa elharrany
No ratings yet
Jds 1019
Document14 pages
Jds 1019
POOJA SINGH
No ratings yet
Mirdeep2 y Otros
Document10 pages
Mirdeep2 y Otros
Jorge Hantar Touma Lazo
No ratings yet
Quantotative MDR Method
Document1 page
Quantotative MDR Method
yassermb68
No ratings yet
Research Paper On Pca
Document8 pages
Research Paper On Pca
afmcdeafl
100% (1)
Cancer Info
Document11 pages
Cancer Info
143davbec
No ratings yet
Parkinson
Document7 pages
Parkinson
jaideep
No ratings yet
Pages From 'Coffalyser - Net-Theory - Objasnjenja'
Document2 pages
Pages From 'Coffalyser - Net-Theory - Objasnjenja'
Sanja Cirkovic
No ratings yet
Identification of Alternative Splice Variants in Aspergillus Flavus Through Comparison of Multiple Tandem Ms Search Algorithms
Document10 pages
Identification of Alternative Splice Variants in Aspergillus Flavus Through Comparison of Multiple Tandem Ms Search Algorithms
aahhhiiiittttt
No ratings yet
A Benchmark of Batch-Effect Correction Methods For Single-Cell RNA Sequencing Data
Document32 pages
A Benchmark of Batch-Effect Correction Methods For Single-Cell RNA Sequencing Data
JK
No ratings yet
Biochemical Systematic by Using Dna Fingerprint Data
Document17 pages
Biochemical Systematic by Using Dna Fingerprint Data
pratiwi kusuma
No ratings yet
Hybrid Intelligent Systems: Fuzzy Artmap Neural Networks For Computer Aided Diagnosis Anatoli Nachev
Document8 pages
Hybrid Intelligent Systems: Fuzzy Artmap Neural Networks For Computer Aided Diagnosis Anatoli Nachev
Khalifa Bakkar
No ratings yet
WritingSample 1 PDF
Document2 pages
WritingSample 1 PDF
Akinbolajo Olumide
100% (2)
Writing Sample 101 PDF
Document2 pages
Writing Sample 101 PDF
Akinbolajo Olumide
No ratings yet
WritingSample 1 PDF
Document2 pages
WritingSample 1 PDF
Akinbolajo Olumide
No ratings yet
Heart Disease Prediction Using Associative Relational Classification Technique (Acar) With Som Neural Network
Document7 pages
Heart Disease Prediction Using Associative Relational Classification Technique (Acar) With Som Neural Network
IJMER
No ratings yet
How To Perform A Meta-Analysis With R
Document9 pages
How To Perform A Meta-Analysis With R
Chengta Wu
No ratings yet
Comparative Study of Datamining Algorithms For Diagnostic Mammograms Using Principal Component Analysis and J48
Document9 pages
Comparative Study of Datamining Algorithms For Diagnostic Mammograms Using Principal Component Analysis and J48
ENG AIK LIM
No ratings yet
Atm 08 16 982
Document8 pages
Atm 08 16 982
Jorge
No ratings yet
Deep Learning of Mutation-Gene-Drug Relations From The Literature
Document13 pages
Deep Learning of Mutation-Gene-Drug Relations From The Literature
leehongkai
No ratings yet
Bio in For Matic Spca
Document13 pages
Bio in For Matic Spca
Monica Clements
No ratings yet
The Ribosomal Database Project: Improved Alignments and New Tools For rRNA Analysis
Document5 pages
The Ribosomal Database Project: Improved Alignments and New Tools For rRNA Analysis
Gregorio Arone
No ratings yet
Research Papers On Neural Networks Free
Document8 pages
Research Papers On Neural Networks Free
wzsatbcnd
100% (1)
Ngs - Plot: Quick Mining and Visualization of Next-Generation Sequencing Data by Integrating Genomic Databases
Document14 pages
Ngs - Plot: Quick Mining and Visualization of Next-Generation Sequencing Data by Integrating Genomic Databases
Amany Morsey
No ratings yet
1 s2.0 S1470204504014834 Main
Document1 page
1 s2.0 S1470204504014834 Main
mehrdad_k_r
No ratings yet
Comparing Genetic Evolutionary Algorithms On Three Enzymes of HIV-1: Integrase, Protease, and Reverse Transcriptome
Document13 pages
Comparing Genetic Evolutionary Algorithms On Three Enzymes of HIV-1: Integrase, Protease, and Reverse Transcriptome
AI Coordinator - CSC Journals
No ratings yet
Mining Symptom and Disease Web Data With NLP and Open Linked Data
Document4 pages
Mining Symptom and Disease Web Data With NLP and Open Linked Data
Pablo Ernesto Vigneaux Wilton
No ratings yet
Diagnostic Accuracy of Different Machine Learning PDF
Document6 pages
Diagnostic Accuracy of Different Machine Learning PDF
Warren Seow
No ratings yet
Sas Clinical Imp Questions
Document29 pages
Sas Clinical Imp Questions
Siva Nunna
100% (1)
3413ijcsa05 PDF
Document13 pages
3413ijcsa05 PDF
Vanathi Andiran
No ratings yet
Private Computation On Encrypted Genomic Data
Document21 pages
Private Computation On Encrypted Genomic Data
M'tar Sam
No ratings yet
Research Paper On Protein Protein Interaction
Document7 pages
Research Paper On Protein Protein Interaction
fzpabew4
100% (1)
NLP and Pathology
Document4 pages
NLP and Pathology
Su sure
No ratings yet
Epicalc Book
Document328 pages
Epicalc Book
Luke Mceachron
No ratings yet
Classification of Interviews - A Case Study On Cancer Patients
Document10 pages
Classification of Interviews - A Case Study On Cancer Patients
patel_musicmsncom
No ratings yet
Mastering Parallel Programming with R
From Everand
Mastering Parallel Programming with R
Simon R. Chapple
No ratings yet
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Smart Business Problems and Analytical Hints in Cancer Research
From Everand
Smart Business Problems and Analytical Hints in Cancer Research
Zemelak Goraga
No ratings yet
Biostatistics Explored Through R Software: An Overview
From Everand
Biostatistics Explored Through R Software: An Overview
Vinaitheerthan Renganathan
Rating: 3.5 out of 5 stars
3.5/5 (2)
Social Movements in The Three Worlds
Document2 pages
Social Movements in The Three Worlds
John Louis Reyes Aguila
No ratings yet
Exam2 Biol212 W20
Document5 pages
Exam2 Biol212 W20
John Louis Reyes Aguila
No ratings yet
Exam2 Biol212 W20
Document5 pages
Exam2 Biol212 W20
John Louis Reyes Aguila
No ratings yet
Gastroenteritis Outbreak
Document9 pages
Gastroenteritis Outbreak
John Louis Reyes Aguila
No ratings yet
Motomart Instructions
Document13 pages
Motomart Instructions
John Louis Reyes Aguila
0% (1)