Targeted Projection Pursuit

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Targeted projection pursuit

Targeted projection pursuit is a type of


statistical technique used for exploratory data
analysis, information visualization, and
feature selection. It allows the user to
interactively explore very complex data
(typically having tens to hundreds of
attributes) to find features or patterns of
potential interest.

Conventional, or 'blind', projection pursuit,


finds the most "interesting" possible
projections in multidimensional data, using a
search algorithm that optimizes some fixed
criterion of "interestingness" – such as
deviation from a normal distribution. In
contrast, targeted projection pursuit allows the
user to explore the space of projections by
In this example targeted projection pursuit is being used to
manipulating data points directly in an
explore projections of a gene expression data set. Each of
interactive scatter plot. the 122 points corresponds to a sample taken from a cancer
tumor of four diagnostic classes (represented by color). For
Targeted projection pursuit has found
each sample, the expression level of 100 genes was
applications in DNA microarray data
recorded (represented by the axes). The animation shows
analysis,[1] protein sequence analysis,[2] that TPP is able to separate two of the classes clearly (red
graph layout[3] and digital signal and purple), but two others could not be distinguished (blue
[4]
processing. It is available as a package for and green). The position of the axes then indicates the
the WEKA machine learning toolkit. activation of which genes are most associated with each
class.

References
1. Faith, Joseph; Robert Mintram; Maia Angelova (2006). "Targeted Projection Pursuit for
Visualising Gene Expression Data Classifications" (http://bioinformatics.oxfordjournals.org/c
ontent/22/21/2667.full.pdf?keytype=ref&ijkey=wTnlzDOhmjsF0j3) (PDF). Bioinformatics. 22
(21): 2667–267. doi:10.1093/bioinformatics/btl463 (https://doi.org/10.1093%2Fbioinformatic
s%2Fbtl463). PMID 16954139 (https://pubmed.ncbi.nlm.nih.gov/16954139).
2. Haddow, Chris; Marcus Durrant; Justin Perry; Joe Faith (2011). "Predicting Functional
Residues of Protein Sequence Alignments as a Feature Selection Task". International
Journal of Data Mining and Bioinformatics. 5 (6): 691–705. doi:10.1504/IJDMB.2011.045417
(https://doi.org/10.1504%2FIJDMB.2011.045417). PMID 22295751 (https://pubmed.ncbi.nlm.
nih.gov/22295751).
3. Gibson, Helen; Joe Faith (2011). "Node-Attribute Graph Layout for Small-World Networks"
(https://independent.academia.edu/JoeFaith/Papers/776056/Node-Attribute_Graph_Layout_
for_Small-World_Networks). Proceedings of 15th International Conference on Information
Visualisation.
4. Sujan, Rajbhandari; Joe Faith (2010). "The Use of Linear Projections in the Visual Analysis
of Signals in an Indoor Optical Wireless Link". 2010 7th International Symposium on
Communication Systems, Networks & Digital Signal Processing (CSNDSP 2010). IEEE.
pp. 576–581. doi:10.1109/CSNDSP16145.2010.5580367 (https://doi.org/10.1109%2FCSN
DSP16145.2010.5580367). ISBN 978-1-4244-8858-2.

Further reading
Joe Faith (2007) "Targeted Projection Pursuit for Interactive Exploration of High-
Dimensional Data Sets" (https://independent.academia.edu/JoeFaith/Papers/320274/Target
ed_Projection_Pursuit_for_Interactive_Exploration_of_High-Dimensional_Data_Sets),
Proceedings of 11th International Conference on Information Visualisation

External links
imDEV (https://sourceforge.net/projects/imdev/) free Excel add-in for targeted projection
pursuits using feature selection coupled with PLS and PLS-DA
Targeted Projection Pursuit project page (https://code.google.com/p/targeted-projection-purs
uit/)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Targeted_projection_pursuit&oldid=1021682168"

You might also like