Professional Documents
Culture Documents
LAB Assignment#1
LAB Assignment#1
LAB Assignment#1
. . . .
Sahiwal Campus
. .
. . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . LAB Assignment # 01
Submitted By:
Samina Naz
FA20-BCS-127
Subject:
Introduction to Bioinformatics
Section:
C
Q:1:
Gene Name: keratin 18 [Homo sapiens (human)] ODD Roll Numbers
a) Describe the official name and symbol
Official Gene Symbol: KRT18
Official Gene Name: Keratin 18
KRT18 is the official gene symbol used to represent keratin 18 in scientific
literature and databases. Keratin 18 is a type I keratin protein, which is a
structural protein found in epithelial cells, including those of the liver, intestine,
and pancreas.
b) Download its FASTA Sequence and complete dataset
FASTA Sequence:
>NC_000012.12:52948855-52952906 Homo sapiens chromosome 12, GRCh38.p14 Primary
Assembly
ACCTGTCTTCTCCACTGCCTGTACCAGCCCACCTCAGGTGCCTTCTCGCCGGCCTTCCTCACCCACCATG
TCTCGGCAGTCCTCCATCACCTTCCAGTCTGGCAGCCGCAGGGGCTTCAGCACCACCTCGGCCATCACCC
CGGCAGCTGGCCGCTCCCGCTTCAGCTCTGTCTCTGTGGCCCGCTCTGCAGCAGGGAGTGGGGGCCTGGG
AAGGATCAGCAGTGCTGGGGCCAGCTTTGGAAGCCGCAGCCTCTACAACCTGGGGGGTGCCAAGCGGGTC
TCCATCAATGGGTGTGGCAGCAGCTGCCGAAGTGGCTTTGGTGGCAGGGCCAGCAACAGGTTTGGAGTCA
ACAGTGGATTTGGCTATGGGGGTGGAGTTGGAGGAGGCTTCAGTGGCCCCAGCTTCCCCGTGTGTCCCCC
TGGAGGCATCCAAGAGGTCACTGTCAACCAGAGTCTCCTGACTCCTCTTCACCTGCAAATCGACCCCACC
ATCCAGCGGGTGCGGGCCGAGGAGCGCGAGCAGATCAAGACCCTCAACAATAAGTTCGCCTCCTTCATCG
ACAAGGTAAGCAGGGGCTTCATCCACCCCCTTGGGTTTGGGATCAAATAAACTCTTGGAAGGGCCATCCC
ATGGGGGGAGAGCAATAATGCAATGACCCCACTGTGGGAATGAGCACTGTTCAGCACGGGCTCCCAGGGG
CTGAGACCCTTCCAAGTCAGGCCAGCTTGCCCCACAGGACCTGGTAGAAATTTCCTCTCTTTCGGAGCCA
CATGGGCTGGTTCAGTCAACACCAAGGGAAGAGTTTTGTTGATTCTCTACAGGAGAGTTGCTGCTCAGCA
AACTACCCTAACCCAGAGTAGGTGGTGTTGAGAAACTTAACCCAAGAGCAGCTCCCCAACAGAAGCCTCT
AGGCCCCACCACCCGAATCCTATGCAAGCCCTAGGGAACTTTGCGGTAGCTCCATGGACTGCCTCCTTTT
GGGTTGGAGTTGTCAGTTACATTATTGTCAATGGGTGCCAAGTGAAAAAAATATCTTTTTCTTCTCTCCC
TTATTAATAGGAGTGTCTAACTCTGCCTCTCCAACTTCTCAAGGTTTCATTCTCTCTTCTTCCCTCCAGG
TGAGGTTCTTGGAGCAGCAGAACAAGGTCCTGGAGACCAAGTGGGCCCTCCTGCAGGAGCAGGGCTCCAG
GACTGTGAGGCAGAACCTAGAGCCCCTCTTTGATTCCTATACCAGTGAGCTCCGACGGCAGCTGGAAAGC
ATCACCACCGAGAGGGGCAGGCTTGAAGCTGAACTGAGGAACATGCAGGATGTTGTGGAAGATTTCAAAG
TCAGGTAAGTGGGAGACTGGCTTCTGGCCACACACAGCCATCTGAAGGCTCCTTTGTGTGAGGACCAGAG
AGGTGCAAAGGAGCAAATGCCGATATCAGCCGGGAGCTTTGGAACTGCAGCCTTTATCCTGCAAGGTGGA
GACAGCACTGTGTGGGGTAGCACAAGGACCTTCATCTTGTGTATTGCTATGAAGATCCTATTCCTATTTG
TTCACATGACTGCAAAGGGATATCAACATCATCCGACAAAATATTGCTACTCACTCTAGAAATCATAATG
TAATTTCACAGTCAGCATATTGTTTAATTCCATTGGACATGGGCTCAATTATTGAGATGGTTTGCATTTC
CAGGATGGCATTCACTGTAGAGTGAAGAGAGGTAACAAGGAAGAGTTTAAAGGAGGGCAATCTGACTTTT
CTTGTGGGGGGAAACTTTTGACTGCACATCATCCAGGCTGCAGTAGGTGAGGTATCCAGTGGAAAGGGAT
TTGGTCCAGACTAACCCATTAGCCAGCTTGCCCTTATTTCTAAGCTTGAGCTACCCCTAGTTACAAAAAG
CATATTTCTCAGGGGCCACCCTGAGGTTCAGTGAAAATATATCAAAATCTACCCGAAACAGCCTGCTGAA
CAGATAAAGTTTACTCCACGATTTCGGTAGACACTTGAGGTGGAAAAACATTTTAGATTAGCATTGCATA
TCTATCAAAAGATGGAGTTGCATAAGTATATTTTTGTATTCCTTTAAGTTCCATGCTCACCTTCTACTCT
TAGAAAAGTTTATTAAAAACTAGCAGAAGAAATTATCCCCCTTTTATGGAGGGAGAAATAGAGGTTAAAT
GAATCACCCAAAGTCATGCCCAGGTCAGCAGAAGAGCTGGAAATGATTTGAGGGCTCTTAACTCTTGCTA
CACCAGCCTGGAAAGAGTTCAATGCAGACTTGCTCAACAGCACACCCCCCGCTCTGTCCTTATAGGTACG
AAGATGAAATTAACAAGCGCACAGCTGCTGAGAATGAATTTGTAGCCCTGAAAAAGGTGAGTGGGGATGT
TTCTCTCAAAGGAGAAGGTTTAAAATGGAATCTGGAGTGTGGGGTAACCTGACCTCTGACCCTTGGGCCA
CCCAAGAAATGTCACATCAACCTTGAGAATATCTGCAGAGTTCAAACCTCCCAACATGTTCCCTCTACGT
TGATGGCCCCATTAGCCCTACTTGGGTTTCTGTGAGCTCAGGGAATTCAAGCCCCCAGTTCTCCGTAATT
ACCCATCCCCAACCCCAAATCACCCAGACCCAGAGTTTTCTAAAATCCAAACTAGATGGGCTGGGGAGAA
ATCTGCTCAGCTTCTTTTGGACTAGATACTTGGGGCTGCAGACTCAAAGGAGCATCCTGCGCTGTCATTC
CAGGACGTAGATGCTGCCTATATGAACAAGGTGGAGCTGGAAGCCAAGGTCAAATCTCTGCCCGAGGAGA
TCAACTTCATCCACTCAGTCTTTGATGCAGTAAGAGTCTGCAAGTATTTCTGTCTCTCCTAGGTCTGGAG
CCTGGAAAGAAAAGGATGATGCATTGTGCCATTCATTCATTCAGTGCCTCTGCCCAGCATCTTGCTTGGA
TACTTCAAGCTGGGGCTTGGGTGGTAGGGGGACCAGGGAGAACCACTTGGAGCCTTGTCATACTAAACTA
CCCCAGCTCTGATGCTTCTCCCCCGAACTCCCTTATCCCATGGCAGGACAACATCTAACAAGGAGAGAGT
ACTGATTCCCAAATTCTAGGACACTGGGTAATTTCCCAAGGAATCAACACATACCTGCTCCCTTCCCCTT
CCCTGAAAAGTATTAAAAAAAAAAAAAGGCAAGCTGTCCCTCAGCTCATTGGTCAGGGGGCTTCCTACCC
TTTGAAGATGCCTCATTCTGGGCGCTACCCCTCCAAAGGCAGAGACCTGGGTCTGTGGAAAGGGAGAGAG
AGGTAGAAAGTGATGGAGTCACACTGTGCAGGGGGAAGCAGTGTCCCAGATGTCCCCAATGCTCTTAAAG
AAGCCGTTTATGTTGGCATACTGAACAGACACGTGTCACATTCAAATCTATGTAATTCCAATTCAGAGGA
TGCTTACCCACCCTCATGCCTGGGGAAAGCACACAGATGGGAGCCTTGAGAAGTAAGGCTGGGAAAGATT
TTCACCACACTGATCTTTTGGGATGAATGGGAATACCTAGGGAATAGCTGGGAGAATCTGCCAGGAATAC
CAAGGAAAGGATTCCTGACCTAGATGGGTAGTTAACCACGAAGATTTAAGATTCTTCATCTTATGCCTTG
GTGATGCTGAGTTTACTGCCCTGCAGGAGCTGTCCCAGTTGCAGACCCAGGTCGGTGACACATCCGTGGT
GCTGTCCATGGACAACAACCGCAACCTGGACCTGGATAGTATCATCGCCGAGGTCAAAGCACAATACGAG
GACATTGCCAACCGCAGCCGGGCCGAGGCTGAGTCCTGGTACCAGACCAAGGTGAGCATGGACACCTCCA
TGAGAGGTTCCAGGGTTAGTGTTCTCTGAGGCTCCACATTATCACTTAACTCAGCCTCAGGAAACGTGTG
AGCACATTCGTTTATTTCAACTTAGCAGGCATGTCTTTGATGCTATGACAACTTAGCTTGAAATGCATGT
GGAAACCGAACCAGACACACTAATACATGGTCAGCCCAATGCTGGGAGCTCAGGACATCCACTGGCCCCA
CATTCCTCAAGATCTGGGTGGGAGCAGGGTGAGACACCAGGACAACCGAGACACAGTCATGAAGCAGTTT
CTAAAAGGCTTATTTATTCTCTATATATTTTCTGAGCTCCTGCTGTATGCCAATCAGGGTTACAGGGTTG
CAAATAAATAAACTGCAAACAGAGAACCCAAGCTCTGGGAGGCCATGAAGTGAATGGACAATCATGGAAG
GGAAAAGATAGCATGAATAAAAAGCTTCCAGGAAGACATGGGGGCTTTGTACAGTTGGGAAGCCATGAGG
GACAAAAGATTGCTGAGGAGTGGGGAGAGGTTTAAGGCTGAACAAGGAGCTGGCAGGCAAGAACAAGCAA
GGGAGTTTATGTCAGGAAGAGGAAGGCTGGGATAAACACAAACAGCTACTGCCCAGAGCTCAGACAGCCG
CAAAGAAGTTTGGCTTTGCGGGGTACAATTGACCCATGATACCAGCTCCCTGTCAAATCCAGACCCCTCT
TGGGGCAGCTTCTCACCTACGAGCAGGTTCCAACTCTTTCCCTGCTCCATACGTTGCCTCATCCCTTCTG
GTCAGGAGTGTGGTGGAAAGGAAGGTGGGTAGCAGGGACCAGGGTTCCACAGGGCAGAGGCAGCGCCTTG
ACGGTGAAAGGAAACATGATGCACTTAACCCCAAGGTGAAGTGGTTGAAATCGATAGCAAACGATTCCTC
ATGTTGTTGGGTTGTTGCTCCATTTAATCATGTATACCTAGAAGCGGGAACCTGAGCTATTCAGCACTTT
CAAGAACCCCACAGATCTTGACTCTGGCAGGGGATCTCCTTTTGTCAGGGAAAGGTGTAGGTTTCACTTC
AGTCTGCTGGAGGAGACAGGGTGTAATTATTGCTCTTAAATTCACATGTCCTGGATATGCACCATTAGAT
TGAGAACTACCTGAGATTGGGAATACTTTTACAAAGTCTTCAAAATTGTGCCTTCCACAGCCTCTGCACC
ATCCCACACCATTCCCCCATATCTCCTCCCTGTTTCCCCAGCACTGGTGTTTGGAGGGCTACCAAAATTC
ATGGGCACAGTTGGTCTGGATGCACGCTCTGTGACCAGGAACTACCCAGGGACCTTGATCAAATCACCGT
CTCACTCCTAGAACTCACCATGTTCCTTCCCTGACCCAGAGTCTTCATGCAGGCTGTTTCCTTTGTCTGG
AATGTTCTCCCCCACAACTGGTCACTTAGGTCCCTCCTTCTCATCCTTCAGACCACAGTTCAAGCATCTC
CATCTCTGGAGAGACTTCTCTGACCACCACTTCCCACTTCCAAATCTAGGTCAGATTCCTTCATTACACT
CTCCCAGGACCCTGTTCATTTCCTAAGGGCACTTATCTTAGTGTGGAACTATACATTTGATAATATGCTA
ATTCAGTTAATGTCTATGTCCCCCAATAAACTGTAAGCTTCAGGGGGAATGAGTGAATGACCAGGAATGA
ATGAGCCTGCTTGTGGCACCCAGGGTGGGTCTGTGTGCACAGCGAGTGCCTGGGCCAGGCATTTGACTCA
GTGACTGGGTTTGCTCTGGTTCTCTCAGTACGAGGAGCTGCAGGTCACCGCAGGCAGACATGGGGATGAC
CTTCGAAACACCAAACAAGAGATCTCTGAAATGAACCGCATGATCCAGAGGCTGAGAGCTGAGATTGACA
Dataset:
a) Explain gene type
Gene type:
protein coding.
The gene type of keratin 18 (KRT18) is a structural gene. Keratin 18 encodes a
protein that is a structural component of epithelial cells in various tissues,
including the liver, intestine, and pancreas. As a structural gene, KRT18's primary
function is to produce the protein necessary for maintaining the structural integrity
of these epithelial cells.
Aliases:
Some other names or aliases for Keratin 18 may include:
CK18: CK18 stands for Cytokeratin 18, which is another name commonly used to
refer to Keratin 18. Cytokeratins are a subgroup of keratins that are specifically
found in epithelial cells.
KRT18E: This is an abbreviation for Keratin 18 Endo, which indicates the
endogenous form of Keratin 18.
c) Summaries its function and expression in your own words
Function:
Involved in the uptake of thrombin-antithrombin complexes by hepatic cells (By
similarity).
When phosphorylated, plays a role in filament reorganization. Involved in the
delivery of mutated CFTR to the plasma membrane. Together with KRT8, is
involved in interleukin-6 (IL-6)-mediated barrier protection
Expression:
Tissue specificity:
Expressed in colon, placenta, liver and very weakly in exocervix. Increased expression observed in
lymph nodes of breast carcinoma.
RefSeq status
REVIEWED
The reference sequences usually include the DNA sequence (nucleotide sequence)
encoding KRT18 gene.
RNA Information:
RNA information for Keratin 18 includes mRNA sequences transcribed from the
KRT18 gene. These mRNA sequences can be found in databases such as
GenBank, RefSeq, and Ensembl.
RNA expression data, such as tissue-specific expression patterns, can be obtained
from databases like GTEx (Genotype-Tissue Expression) and TCGA (The Cancer
Genome Atlas).
Protein Information:
Protein information for Keratin 18 includes the amino acid sequence of the KRT18
protein. This sequence can be retrieved from protein databases like UniProt.
Additional information about the protein structure, domains, post-translational
modifications, and functional annotations may also be available in protein
databases and literature.
To access specific information about reference sequences, RNA, and protein for
Keratin 18, you can search these databases using the gene symbol (KRT18) or its
associated identifiers.
Q:2:
a) Protein name of your corresponding gene and its annotation status
and score
Protein names
Recommended name