1 4 Characters

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

06/12/2016

Character data in
BioNumerics...

Character data = anything that can be


represented by a set of numbers

Spectrophotometric and
chromatographic profiles
with known peaks

Phenotypic test panels

MLVA/VNTR, MLST,
Spa repeats

Gene chips and


2
DNA arrays

1
06/12/2016

There are different classifications of


character data
• Data type:
Binary: 10100111010011001110…
Numerical: 12.31, 1.36, 6.50, 9.78, 1.21, ...
Categorical: S-I-R
yellow, brown, green ...
allele 1, allele 2, allele 3, ...
3 repeats, 5 repeats, 12 repeats, ...

• Data set:
Closed: number of characters is fixed.
Example: API tests.

Open: number of characters may


grow by adding new entries.
Example: Fatty acid analysis
3

Character data can be imported in various


formats
Char 1 … Char n
Entry 1 x11 … x1n
• By hand
… … … …
• Excel files
• Access files Entry m xm1 … xmn

• Delimited text files: tab, pipe, csv, …


• All other ODBC compatible sources
(Open DataBase Connectivity)
• Pictures (bmp,tif) via BNIMA
• Other files via dedicated scripts
•…

2
06/12/2016

Character visualisations: not just a pretty face

Each type of character data has dedicated


analysis methods
• Pearson correlation
• Cosine correlation
• Euclidian distance
• Gower
• Canberra metric
• ...

• Jaccard
• Dice
• Simple matching

• Categorical

• Numerical values can be converted to binary data!


• Absent values are allowed.
6

You might also like