Tools for the biologist enabling optimized use of gene trap clones

  Homepage | Blast Search | GO Search | Advanced Search | About

Gene ENSMUSG00000045103 (Dmd)
Chromosomal location
Chr X: 80194242 - 82451480 (+)
Description
dystrophin, muscular dystrophy Gene [Source:MGI (curated);Acc:Dmd-001]
RefSeq_dna
NM_007868 
RefSeq_peptide
NP_031894.1 
UniGene
Mm.407754 Mm.275608 Mm.368403 Mm.416750 
MGI
MGI:94909 
Uniprot/SWISSPROT
P11531 
Uniprot/SPTREMBL
A2A9Z2 Q3TWL4 Q8BHM1 Q3UF47 A2A9Z1 Q5U476 Q9R0A2 A2A9Z0 
Human Ortholog
ENSG00000198947 (DMD)
Omim 300376 - MUSCULAR DYSTROPHY, BECKER TYPE  302045 - CARDIOMYOPATHY, DILATED, 3B  310200 - MUSCULAR DYSTROPHY, DUCHENNE TYPE  
UniTrap UNI30637
Vector Insertion
Chr X: 80194487 - 80381966
Public Clones IST12367D6BBF1 (tigm) IST12618E10 (tigm)
Private Clones not available
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI33489
Vector Insertion
Chr X: 81677332 - 81719472
Public Clones IST12890E5 (tigm) IST10969F1 (tigm) IST10090F10 (tigm) IST12322G5 (tigm)
Private Clones not available
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI32672
Vector Insertion
Chr X: 81719706 - 81777212
Public Clones IST10430A3 (tigm) IST11524H9 (tigm)
Private Clones not available
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI38647
Vector Insertion
Chr X: 81844590 - 81892375
Public Clones (sanger)
Private Clones not available
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI37724
Vector Insertion
Chr X: 81892566 - 82018049
Public Clones (ggtc)
Private Clones not available
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI35867
Vector Insertion
Chr X: 82094134 - 82189483
Public Clones IST12646G9 (tigm)
Private Clones not available
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI31850
Vector Insertion
Chr X: 82221047 - 82293751
Public Clones IST10068B10 (tigm)
Private Clones not available
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI22807
Vector Insertion
Chr X: 82293799 - 82299309
Public Clones not available
Private Clones OST284316 (lexicon) OST168668 (lexicon)
Severity of mutation (?) Insertion after 1% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI6101
Vector Insertion
Chr X: 82299372 - 82335155
Public Clones (sanger) YHD235 (baygenomics) HMA421 (baygenomics) E093H06 (ggtc)
D081F07 (ggtc) E045F10 (ggtc) E093H07 (ggtc) D177G07 (ggtc) E048E11 (ggtc)
D005A07 (ggtc) E043E12 (ggtc) D110F05 (ggtc) E125H09 (ggtc) D177H08 (ggtc)
CMHD-GT_341A4-3 (cmhd) CMHD-GT_537F2-5S (cmhd) CMHD-GT_329G9-3 (cmhd) CMHD-GT_537F2-3 (cmhd)
CMHD-GT_351C12-3 (cmhd) IST15097G1 (tigm) IST11807H5 (tigm) IST12539C12 (tigm)
IST10118H7 (tigm) IST11785F11 (tigm) IST14706B6 (tigm) IST10982A2 (tigm)
IST13031A3 (tigm) IST11969B11 (tigm) IST11608B5 (tigm) IST14783D11 (tigm)
IST10190D12 (tigm) IST14802F4 (tigm) IST12301C11 (tigm) IST15065G4 (tigm)
IST12022G1 (tigm) IST12409A6 (tigm)
Private Clones OST472718 (lexicon) OST285335 (lexicon) OST283399 (lexicon) OST261937 (lexicon)
OST253426 (lexicon) OST104421 (lexicon) OST77156 (lexicon) OST68336 (lexicon)
OST63600 (lexicon) OST37154 (lexicon)
Severity of mutation (?) Insertion after 5% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI22934
Vector Insertion
Chr X: 82335231 - 82350482
Public Clones CMHD-GT_402E3-3 (cmhd)
Private Clones OST280872 (lexicon) OST67888 (lexicon)
Severity of mutation (?) Insertion after 9% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI23949
Vector Insertion
Chr X: 82350685 - 82352918
Public Clones not available
Private Clones OST239694 (lexicon)
Severity of mutation (?) Insertion after 20% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI4390
Vector Insertion
Chr X: 82355440 - 82384666
Public Clones AE0481 (sanger) XS0600 (sanger) RRP045 (baygenomics)
Private Clones not available
Severity of mutation (?) Insertion after 33% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI5399
Vector Insertion
Chr X: 82424162 - 82435948
Public Clones CSI883 (baygenomics) CSI560 (baygenomics) W007D09 (ggtc) W007E09 (ggtc)
W056C05 (ggtc)
Private Clones not available
Severity of mutation (?) Insertion after 93% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI9581
Vector Insertion
Chr X: 82436042 - 82442842
Public Clones XC730 (baygenomics) XC368 (baygenomics)
Private Clones not available
Severity of mutation (?) Insertion after 98% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

Show all transcripts and translations:
Transcript ENSMUST00000113991
Sequence
CTCACTGCCTGTGAAACCCTTACAACCATGAGGGAACACCTCAAAGG - UNI22807 - CCACGAGACCCAAACCACTTGTTGGGACCACCCCAAAAT

GACAGAGCTCTACCAGTCTTTAG - UNI6101 - CTGACCTGAATAATGTCAGGTTCTCCGCGTATAGGACTGCCATGAAGCTCAGAAGGCTCCAGAA

GGCCCTTTGCT - UNI22934 - TGGATCTCTTGAGCCTGTCAGCTGCATGTGATGCCCTGGACCAGCACAACCTCAAGCAAAATGACCAGCCCATGG

ATATCCTGCAGATAATTAACTGTTTGACTACAATTTATGATCGTCTGGAGCAAGAGCACAACAATCTGGTCAATGTCCCTCTCTGTGTGGATATGTGTCT

CAACTGGCTTCTCAATGTTTATGATAC - UNI23949 - GGGACGAACAGGGAGGATCCGTGTCCTGTCTTTTAAAACTGGCATCATTTCTCTGTGTA

AAGCACACTTGGAAGACAAGTACAGAT ACCTTTTCAAGCAAGTGGCAAGTTCAACTGGCTTTTGTGACCAGCGTAGGCTGGGTCTTCTTCTGCATGATT

CTATTCAAATCCCAAGACAGTTGGGTGAAGTTGCTTCCTTTGGGGGCAGTAACATTGAGCCGAGTGTCAGGAGCTGCTTCCAATTT - UNI4390 - G

CCAATAATAAACCTGAGATTGAAGCTGCTCTCTTCCTTGACTGGATGCGCCTGGAACCCCAGTCTATGGTGTGGCTGCCCGTCTTGCACAGAGTGGCTGC

TGCTGAAACTGCCAAGCATCAAGCCAAGTGTAACATCTGTAAGGAGTGTCCAATCATTGGATTCAG GTACAGAAGCCTAAAGCATTTTAATTATGACAT

CTGCCAAAGTTGCTTTTTTTCTGGCCGAGTTGCAAAGGGCCATAAAATGCACTACCCCATGGTAGAGTATTGCACTCCG ACTACATCCGGAGAAGATGT

TCGCGACTTCGCCAAGGTACTAAAAAACAAATTTCGAACCAAAAGGTATTTTGCGAAGCATCCCCGAATGGGCTACCTGCCAGTGCAGACTGTGTTAGAG

GGGGACAACATGGAAAC TCCCGTTACTCTGATCAACTTCTGGCCAGTAGATTCTGC GCCTGCCTCGTCCCCCCAGCTTTCACACGATGATACTCATTC

ACGCATTGAACATTATGCTAGCAG GCTAGCAGAAATGGAAAACAGCAATGGATCTTATCTAAATGATAGCATCTCTCCTAATGAGAGCAT AGATGATG

AACATTTGTTAATCCAGCATTACTGCCAAAGTTTGAACCAGGACTCCCCCCTGAGCCAGCCTCGTAGTCCTGCCCAGATCTTGATTTCCTTAGAGAGTGA

GGAAAGAGGGGAGCTAGAGAGAATCCTAGCAGATCTTGAGGAAGAAAACAG GAATCTGCAAGCAGAATATGATCGCCTGAAGCAGCAGCATGAGCATAA

AGGCCTGTCTCCACTGCCATCTCCTCCTGAGATGATGCCCACCTCTCCTCAGAGTCCCAGGGATGCTGAGCTCATTGCTGAGGCTAAGCTACTGCGCCAA

CACAAAGGACGCCTGGAAGCCAGGATGCAAATCCTGGAAGACCACAATAAACAGCTGGAGTCTCAGTTACATAGACTGAGACAGCTCCTGGAGCAG CCC

CAGGCTGAAGCTAAGGTGAATGGCACCACGGTGTCCTCTCCTTCCACCTCTCTGCAGAGGTCAGATAGCAGTCAGCCTATGCTGCTCCGAGTGGTTGGCA

GTCAAACTTCAGAATCTATGG - UNI5399 - GTGAGGAAGATCTTCTGAGTCCTCCCCAGGACACAAGCACAGGGTTAGAAGAAGTGATGGAGCAAC

TCAACAACTCCTTCCCTAGTTCAAGAG - UNI9581 - GAAGAAATGCCCCCGGAAAGCCAATGAGAGAG GACACAATGTAGGAAGCCTTTTCCACA

TGGCAGATGATTTGGGCAGAGCGATGGAGTCCTTAGTTTCAGTCATGACAGATGAAGAAGGAGCAGAATAAATGTTTTACAACTCCTGATTCCCGCATGG

TTTTTATAATATTCGTACAACAAAGAGGATTAGACAGTAAGAGTTTACAAGAAATAAAATCTATATTTTTGTGAAGGGTAGTGGTACTATACTGTAGATT

TCAGTAGTTTCTAAGTCTGTTATTGTTTTGTTAACAATGGCAGGTTTTACACGTCTATGCAATTGTACAAAAAAGTTAAAAGAAAACATGTAAAATCTTG

ATAGCTAAATAACTTGCCATTTCTTTATATGGAACGCATTTTGGGTTGTTTAAAAATTTATAACAGTTATAAAGAAAGATTGTAAACTAAAGTGTGCTTT

ATAAAAAAAGTTGTTTATAAAAACCCCTAAACAAACACACACGCACACACACACACACACACACACACACACACACACGCACACATACATGCACGAACCC

ACCACACACACACACACACACACACACACTGAGGCAGCACATTGTTTTGCATTACTTTAGCGTGGTATTCATATGGAATTCATGACGTTTTTTTATTTTC

TTGCATACGAACCCCACCAAATGACTGCTTCATATTGCTCTTTTGAGAATTGTTGACTGAGTGGGGCTGGCTATGGGCTTTCATTTTATACATCTATATG

TCTACAAGTATATAAATACTATAGGTATATAGATAAATAGATATGAAGTTACTTCTTCAAATGTTCTTGCCACTTCCTAATGGAAATTGCTTCTAGTCAT

CTGGGCTTATCTGCTTGGGCAAGAGTGAATTTTCCCTGGAGCCCAAAGCCAGGAGACTACCGCCACACTAAAATATTGTCTAGGGCTCCAGATGTTTCTA

GTTTTAAACTTTCCACTGAGAGCTAGAGGATTCATTTTTTTCAAGGAACATGCGAATGAATACACAGGACTTACTATCATAGTAATTTGTTGGCTGATAT

ATTCAACTTCCTACTGTTGGGTTATATTTAATGATGTTTCTGCAATAGAACATCAGATGACATTTTTAACTCCCAGACAGTAGGAGGAAGATGGTAGGAG

CTAAAGGTTGCGGCTCCTCAGTCAATTTATATGAGGGGAGCAACAACTCTGTAAAAGAATGGATGAATATTTACAACTATACATATAAACATCTCTATAA

TTACAACTAAATTGTTCTGCCCTCTTCATAAACTCAACCTGAAGTGGGTGGTTTTGTTGTTGTTGTTGTTGTTGTTGTTGATGATGATGATGAATTTTAG

ATTTTAGATTTTTTGGGTTTTTTTTTCTTCATTGTGATGATTTTTTTTTTTAATGCTGCAAGACTTAGGATTACTGTTAAGAAAGTAACCCAATCACATT

GTGACCCTGGTGAATATCAGTCCAGAAGCCCATGAACTGCATTTGTCTCCTTTGCATTGGTTTCCCTGCAAGTAACTCCACACAGGATTGTGGGTGAGAA

GGCACAGTGGTTGGAAAGTTTTGAGAGCAAAAGCGTCTCCAAACTCTCTGGTCTAGTTGACGGGCTGAAATGTCTAAACAAATGCAAGTCATTGAACCAG

GAGAAAAAGTGCAACAGAAAGCTAAGGACTGCTAGGAAGAGCTTTACTCCTCTCATGCCAGTTTCTTCTTCTTAGCATTTAAAGAGCATTCTCTCAATAG

AAATCACTGTCCTATCATTTTGCAAATCTGTTACCTCTAACGTCAAGTGTAATTAACTTCTAGCGAGTGGGTTTTGTCCATTATTAATTGTAATTAACAT

CAAACACAGCTTCTCATGCTATTTCTACCTCACTTTGGTTTTGGGGTGTTTCTAGTAATTGTGCACACCTAATTTCACAACTTCACCACTTGTCTGTTGT

GTGGACACCAGTTTCCTTTTTTCATTTATAATTTCCAAAAGAAAACCCAAAGCTCTAAGATAACAAATTGAAATTTGGTTCTGGTCTTGCTTTCTCTCTC

TCTCTCTCCTTTATGTGGCACTGGGCATTTTCTTTATCCAAGGATTTGTTTTCACCAAGATTTAAAACAAGGGGTTCCTTTCCTACTAAGAAGTTTTAAG

TTTCATTCTAAAATCCAAGGTAGATAGAGTGCATAGTTTTGTTTTAATCTTTTCGTTTTATCTTTTAGATATTAGTTCTGGAGTGAATCTATCAAAATAT

TTGAATAAAAACTGAGAGCTTTATTGCTGATTTTAAGCATAATTTGGACATCATTTCATGTTCTTTATAACCATCAAGTATTAAAGTGTAAATCATAATC

AGTGTAACTGAAGCATAATCATCACATGGCATGTATCATCATTGTCTCCAGGTACTGGACTCTTACTTGAGTATCATAATAGATTGTGTTTTAACACCAA

CACTGTAACATTTACTAATTATTTTTTTAAACTTCAGTTTTACTGCATTTTCACAACATATCAGATTTCACCAAATATATGCCTTACTATTGTATTATAT

TACTGCTTTACTGTGTATCTCAATAAAGCACGCAGTTATGTT
Translation
MREHLKG - UNI22807 - GHETQTTCWDHPKMTELYQSLA - UNI6101 - ADLNNVRFSAYRTAMKLRRLQKALCL - UNI22934 - LDLL

SLSAACDALDQHNLKQNDQPMDILQIINCLTTIYDRLEQEHNNLVNVPLCVDMCLNWLLNVYDT - UNI23949 - TGRTGRIRVLSFKTGIISLCKA

HLEDKYRY YLFKQVASSTGFCDQRRLGLLLHDSIQIPRQLGEVASFGGSNIEPSVRSCFQF - UNI4390 - ANNKPEIEAALFLDWMRLEPQSMVW

LPVLHRVAAAETAKHQAKCNICKECPIIGFR RYRSLKHFNYDICQSCFFSGRVAKGHKMHYPMVEYCTP TTSGEDVRDFAKVLKNKFRTKRYFAKHPR

MGYLPVQTVLEGDNMET TPVTLINFWPVDSA APASSPQLSHDDTHSRIEHYASR RLAEMENSNGSYLNDSISPNESI IDDEHLLIQHYCQSLNQDS

PLSQPRSPAQILISLESEERGELERILADLEEENR RNLQAEYDRLKQQHEHKGLSPLPSPPEMMPTSPQSPRDAELIAEAKLLRQHKGRLEARMQILED

HNKQLESQLHRLRQLLEQ PQAEAKVNGTTVSSPSTSLQRSDSSQPMLLRVVGSQTSESMG - UNI5399 - GEEDLLSPPQDTSTGLEEVMEQLNNS

FPSSRG - UNI9581 - GRNAPGKPMRE DTM
Transcript ENSMUST00000101404
Sequence
ATGATTATTGACTGTCCAGGCTATGCTCTTAAACTCATTCTGTGAACTCATTCTTACAGAATGTAAGCAAAGTTAGCATTTTAAAGCAGGGCCCATTCGG

TTTCTGGGTTTTCTTAGGTTTGCTATGCAACAGGATCAGTGCTGTAGTCCCCGGTTCAAGCTGAAAATGTTGCACAGGAAGACATATCATGTAAAG GAT

CTCCAAGGAGAAATTGAAACTCACACAGATATCTATCACAATCTTGATGAAAATGGCCAAAAAATCCTGAGATCCCTGGAAGGTTCGGATGAAGCACCCC

TGTTACAAAGACGTTTGGATAACATGAATTTCAAGTGGAGTGAACTTCAGAAAAAGTCTCTCAACATTAG GTCCCATTTGGAAGCAAGTTCTGACCAGT

GGAAGCGTTTGCATCTTTCTCTTCAGGAACTTCTTGTTTGGCTACAGCTGAAAGATGATGAACTGAGCCGTCAGGCACCCATCGGTGGTGATTTCCCAGC

AGTTCAGAAGCAGAATGATATACATAGG GCCTTCAAGAGGGAATTGAAAACTAAAGAACCTGTAATCATGAGTACTCTGGAGACTGTGAGAATATTTCT

GACAGAGCAGCCTTTGGAAGGACTAGAGAAACTCTACCAGGAGCCCAGAG AACTGCCTCCTGAAGAAAGAGCTCAGAATGTCACTCGGCTCCTACGAAA

GCAGGCTGAAGAGGTCAACGCTGAATGGGACAAATTGAACCTGCGCTCAGCTGATTGGCAGAGAAAAATAGATGAAGCTCTTGAAAGACTCCAGGAACTT

CAGGAAGCTGCCGATGAACTGGACCTCAAGTTGCGCCAAGCTGAGGTGATCAAGGGATCCTGGCAGCCAGTGGGGGATCTCCTCATTGACTCTCTGCAAG

ATCACCTTGAAAAAGTCAAG GCACTTCGGGGAGAAATTGCACCTCTTAAAGAGAATGTCAATCGTGTCAATGACCTTGCACATCAGCTGACCACACTGG

GCATTCAGCTCTCACCTTATAACCTCAGCACTTTGGAAGATCTGAATACCAGATGGAGGCTTCTACAG - UNI35867 - GTGGCTGTGGAGGACCGT

GTCAGACAGCTGCATGAAGCCCACAGGGACTTTGGTCCTGCATCCCAGCACTTCCTTTCCA CTTCAGTTCAGGGTCCCTGGGAGAGAGCCATCTCACCA

AACAAAGTGCCCTACTATATCAAGTAAGTCAAAAGCATTTATGTACCTGATCTGTATGTTCATGAATGAATTTCTTTTTAAAAGTCATTATTCTCATCTA

CAATATCTATTTCTTTTTACGCCTAAGGAAGCATAGATAATAAATTCATACTGTAAATTTGTTGATGCAATATGAAATCACTTTTATGAACAATACATTT

TAGGCAGCATATATGGTGTGTTTTGGAGGGTGAGCAGAATTTTTACAGCAGGGCAAAAATATCCAGTAAAATCATTGATAGTGGGAGTGAATAATTGACA

GGAAAATATTATTTTTGTAGGTGTTTCTGCATCTCTTATTCTTCAGAGGTAAAAGGTCTATCAAGTGATACTTACACAAACGGTTGAAATGAATTACGTG

TCATATAACCCATAGGAGGCAGACAGTCCAGAATTTATGGAGCTGAGTTGCTCATTCTCAACATATAGCTCTTTGACTTAGTTACCTAATAATAGCTTCA

GAGTGGAACTCTTTTTTTCTGGGACCATTTGTGACAATCAAAGGAAAAGAGTCAAAGGATAGATGTTCTATGAAACATAAATTATGATATATTGCCTTGA

CACGGAGAAGAAAGGATATTGAGTTATTATGATGCTTTAACAGTCTCTCATCAATCATGCAAGAATCTAACTTAAGTCTAACTTTGAATTGGCTCACCCA

ACAACACTTAGAAGGAACGGTGTTTATGAAATAGTCTTTAGTAGCAACTAATGTTTTTGAACCTTTTTTTTTTTTGGCATGAATCTGGAAACAAATATTA

ACTGTTTCCCCAGTTATTCAGCCTGCCATCTCTCATCAAGTCTGTTTTTATTCTTCCAACCCATCAATTCTTCATAATTCAGCTCCAAAGCAGTCACTAC

AGTCCTAGATCTTCATTACTTCCGTTCTCAAGAAACGTAATTTCAGGCCTCTATTTTGTTAACTC
Translation
MQQDQCCSPRFKLKMLHRKTYHVK DLQGEIETHTDIYHNLDENGQKILRSLEGSDEAPLLQRRLDNMNFKWSELQKKSLNIR RSHLEASSDQWKRLHL

SLQELLVWLQLKDDELSRQAPIGGDFPAVQKQNDIHR AFKRELKTKEPVIMSTLETVRIFLTEQPLEGLEKLYQEPRE ELPPEERAQNVTRLLRKQAE

EVNAEWDKLNLRSADWQRKIDEALERLQELQEAADELDLKLRQAEVIKGSWQPVGDLLIDSLQDHLEKVK ALRGEIAPLKENVNRVNDLAHQLTTLGIQ

LSPYNLSTLEDLNTRWRLLQ - UNI35867 - VAVEDRVRQLHEAHRDFGPASQHFLST TSVQGPWERAISPNKVPYYIK
Transcript ENSMUST00000113992
Sequence
CTCACTGCCTGTGAAACCCTTACAACCATGAGGGAACACCTCAAAGG - UNI22807 - CCACGAGACCCAAACCACTTGTTGGGACCACCCCAAAAT

GACAGAGCTCTACCAGTCTTTAG - UNI6101 - CTGACCTGAATAATGTCAGGTTCTCCGCGTATAGGACTGCCATGAAGCTCAGAAGGCTCCAGAA

GGCCCTTTGCT - UNI22934 - TGGATCTCTTGAGCCTGTCAGCTGCATGTGATGCCCTGGACCAGCACAACCTCAAGCAAAATGACCAGCCCATGG

ATATCCTGCAGATAATTAACTGTTTGACTACAATTTATGATCGTCTGGAGCAAGAGCACAACAATCTGGTCAATGTCCCTCTCTGTGTGGATATGTGTCT

CAACTGGCTTCTCAATGTTTATGATAC - UNI23949 - GGGACGAACAGGGAGGATCCGTGTCCTGTCTTTTAAAACTGGCATCATTTCTCTGTGTA

AAGCACACTTGGAAGACAAGTACAGAT ACCTTTTCAAGCAAGTGGCAAGTTCAACTGGCTTTTGTGACCAGCGTAGGCTGGGTCTTCTTCTGCATGATT

CTATTCAAATCCCAAGACAGTTGGGTGAAGTTGCTTCCTTTGGGGGCAGTAACATTGAGCCGAGTGTCAGGAGCTGCTTCCAATTT - UNI4390 - G

CCAATAATAAACCTGAGATTGAAGCTGCTCTCTTCCTTGACTGGATGCGCCTGGAACCCCAGTCTATGGTGTGGCTGCCCGTCTTGCACAGAGTGGCTGC

TGCTGAAACTGCCAAGCATCAAGCCAAGTGTAACATCTGTAAGGAGTGTCCAATCATTGGATTCAG GTACAGAAGCCTAAAGCATTTTAATTATGACAT

CTGCCAAAGTTGCTTTTTTTCTGGCCGAGTTGCAAAGGGCCATAAAATGCACTACCCCATGGTAGAGTATTGCACTCCG ACTACATCCGGAGAAGATGT

TCGCGACTTCGCCAAGGTACTAAAAAACAAATTTCGAACCAAAAGGTATTTTGCGAAGCATCCCCGAATGGGCTACCTGCCAGTGCAGACTGTGTTAGAG

GGGGACAACATGGAAAC GCCTGCCTCGTCCCCCCAGCTTTCACACGATGATACTCATTCACGCATTGAACATTATGCTAGCAG GCTAGCAGAAATGGA

AAACAGCAATGGATCTTATCTAAATGATAGCATCTCTCCTAATGAGAGCAT AGATGATGAACATTTGTTAATCCAGCATTACTGCCAAAGTTTGAACCA

GGACTCCCCCCTGAGCCAGCCTCGTAGTCCTGCCCAGATCTTGATTTCCTTAGAGAGTGAGGAAAGAGGGGAGCTAGAGAGAATCCTAGCAGATCTTGAG

GAAGAAAACAG GAATCTGCAAGCAGAATATGATCGCCTGAAGCAGCAGCATGAGCATAAAGGCCTGTCTCCACTGCCATCTCCTCCTGAGATGATGCCC

ACCTCTCCTCAGAGTCCCAGGGATGCTGAGCTCATTGCTGAGGCTAAGCTACTGCGCCAACACAAAGGACGCCTGGAAGCCAGGATGCAAATCCTGGAAG

ACCACAATAAACAGCTGGAGTCTCAGTTACATAGACTGAGACAGCTCCTGGAGCAG CCCCAGGCTGAAGCTAAGGTGAATGGCACCACGGTGTCCTCTC

CTTCCACCTCTCTGCAGAGGTCAGATAGCAGTCAGCCTATGCTGCTCCGAGTGGTTGGCAGTCAAACTTCAGAATCTATGG - UNI5399 - GTGAGG

AAGATCTTCTGAGTCCTCCCCAGGACACAAGCACAGGGTTAGAAGAAGTGATGGAGCAACTCAACAACTCCTTCCCTAGTTCAAGAG - UNI9581 - 

GAAGAAATGCCCCCGGAAAGCCAATGAGAGAG GACACAATGTAGGAAGCCTTTTCCACATGGCAGATGATTTGGGCAGAGCGATGGAGTCCTTAGTTTC

AGTCATGACAGATGAAGAAGGAGCAGAATAAATGTTTTACAACTCCTGATTCCCGCATGGTTTTTATAATATTCGTACAACAAAGAGGATTAGACAGTAA

GAGTTTACAAGAAATAAAATCTATATTTTTGTGAAGGGTAGTGGTACTATACTGTAGATTTCAGTAGTTTCTAAGTCTGTTATTGTTTTGTTAACAATGG

CAGGTTTTACACGTCTATGCAATTGTACAAAAAAGTTAAAAGAAAACATGTAAAATCTTGATAGCTAAATAACTTGCCATTTCTTTATATGGAACGCATT

TTGGGTTGTTTAAAAATTTATAACAGTTATAAAGAAAGATTGTAAACTAAAGTGTGCTTTATAAAAAAAGTTGTTTATAAAAACCCCTAAACAAACACAC

ACGCACACACACACACACACACACACACACACACACACGCACACATACATGCACGAACCCACCACACACACACACACACACACACACACTGAGGCAGCAC

ATTGTTTTGCATTACTTTAGCGTGGTATTCATATGGAATTCATGACGTTTTTTTATTTTCTTGCATACGAACCCCACCAAATGACTGCTTCATATTGCTC

TTTTGAGAATTGTTGACTGAGTGGGGCTGGCTATGGGCTTTCATTTTATACATCTATATGTCTACAAGTATATAAATACTATAGGTATATAGATAAATAG

ATATGAAGTTACTTCTTCAAATGTTCTTGCCACTTCCTAATGGAAATTGCTTCTAGTCATCTGGGCTTATCTGCTTGGGCAAGAGTGAATTTTCCCTGGA

GCCCAAAGCCAGGAGACTACCGCCACACTAAAATATTGTCTAGGGCTCCAGATGTTTCTAGTTTTAAACTTTCCACTGAGAGCTAGAGGATTCATTTTTT

TCAAGGAACATGCGAATGAATACACAGGACTTACTATCATAGTAATTTGTTGGCTGATATATTCAACTTCCTACTGTTGGGTTATATTTAATGATGTTTC

TGCAATAGAACATCAGATGACATTTTTAACTCCCAGACAGTAGGAGGAAGATGGTAGGAGCTAAAGGTTGCGGCTCCTCAGTCAATTTATATGAGGGGAG

CAACAACTCTGTAAAAGAATGGATGAATATTTACAACTATACATATAAACATCTCTATAATTACAACTAAATTGTTCTGCCCTCTTCATAAACTCAACCT

GAAGTGGGTGGTTTTGTTGTTGTTGTTGTTGTTGTTGTTGATGATGATGATGAATTTTAGATTTTAGATTTTTTGGGTTTTTTTTTCTTCATTGTGATGA

TTTTTTTTTTTAATGCTGCAAGACTTAGGATTACTGTTAAGAAAGTAACCCAATCACATTGTGACCCTGGTGAATATCAGTCCAGAAGCCCATGAACTGC

ATTTGTCTCCTTTGCATTGGTTTCCCTGCAAGTAACTCCACACAGGATTGTGGGTGAGAAGGCACAGTGGTTGGAAAGTTTTGAGAGCAAAAGCGTCTCC

AAACTCTCTGGTCTAGTTGACGGGCTGAAATGTCTAAACAAATGCAAGTCATTGAACCAGGAGAAAAAGTGCAACAGAAAGCTAAGGACTGCTAGGAAGA

GCTTTACTCCTCTCATGCCAGTTTCTTCTTCTTAGCATTTAAAGAGCATTCTCTCAATAGAAATCACTGTCCTATCATTTTGCAAATCTGTTACCTCTAA

CGTCAAGTGTAATTAACTTCTAGCGAGTGGGTTTTGTCCATTATTAATTGTAATTAACATCAAACACAGCTTCTCATGCTATTTCTACCTCACTTTGGTT

TTGGGGTGTTTCTAGTAATTGTGCACACCTAATTTCACAACTTCACCACTTGTCTGTTGTGTGGACACCAGTTTCCTTTTTTCATTTATAATTTCCAAAA

GAAAACCCAAAGCTCTAAGATAACAAATTGAAATTTGGTTCTGGTCTTGCTTTCTCTCTCTCTCTCTCCTTTATGTGGCACTGGGCATTTTCTTTATCCA

AGGATTTGTTTTCACCAAGATTTAAAACAAGGGGTTCCTTTCCTACTAAGAAGTTTTAAGTTTCATTCTAAAATCCAAGGTAGATAGAGTGCATAGTTTT

GTTTTAATCTTTTCGTTTTATCTTTTAGATATTAGTTCTGGAGTGAATCTATCAAAATATTTGAATAAAAACTGAGAGCTTTATTGCTGATTTTAAGCAT

AATTTGGACATCATTTCATGTTCTTTATAACCATCAAGTATTAAAGTGTAAATCATAATCAGTGTAACTGAAGCATAATCATCACATGGCATGTATCATC

ATTGTCTCCAGGTACTGGACTCTTACTTGAGTATCATAATAGATTGTGTTTTAACACCAACACTGTAACATTTACTAATTATTTTTTTAAACTTCAGTTT

TACTGCATTTTCACAACATATCAGATTTCACCAAATATATGCCTTACTATTGTATTATATTACTGCTTTACTGTGTATCTCAATAAAGCACGCAGTTATG

TT
Translation
MREHLKG - UNI22807 - GHETQTTCWDHPKMTELYQSLA - UNI6101 - ADLNNVRFSAYRTAMKLRRLQKALCL - UNI22934 - LDLL

SLSAACDALDQHNLKQNDQPMDILQIINCLTTIYDRLEQEHNNLVNVPLCVDMCLNWLLNVYDT - UNI23949 - TGRTGRIRVLSFKTGIISLCKA

HLEDKYRY YLFKQVASSTGFCDQRRLGLLLHDSIQIPRQLGEVASFGGSNIEPSVRSCFQF - UNI4390 - ANNKPEIEAALFLDWMRLEPQSMVW

LPVLHRVAAAETAKHQAKCNICKECPIIGFR RYRSLKHFNYDICQSCFFSGRVAKGHKMHYPMVEYCTP TTSGEDVRDFAKVLKNKFRTKRYFAKHPR

MGYLPVQTVLEGDNMET TPASSPQLSHDDTHSRIEHYASR RLAEMENSNGSYLNDSISPNESI IDDEHLLIQHYCQSLNQDSPLSQPRSPAQILISL

ESEERGELERILADLEEENR RNLQAEYDRLKQQHEHKGLSPLPSPPEMMPTSPQSPRDAELIAEAKLLRQHKGRLEARMQILEDHNKQLESQLHRLRQL

LEQ PQAEAKVNGTTVSSPSTSLQRSDSSQPMLLRVVGSQTSESMG - UNI5399 - GEEDLLSPPQDTSTGLEEVMEQLNNSFPSSRG - UNI958

1 - GRNAPGKPMRE DTM
Transcript ENSMUST00000113994
Sequence
AGATTGCAACAACTGAGATGATTTTGCTAATGTTTTCTTCCAGCTTCTTCCTTTGATGTGGATGATACTGTGACTTTTTCGTGCAGATACCTCTGTTCAG

TGTACCAAGTGTTTTTTTTTCTTTTCCTTTTATTTATATAAGCAAAGCTGAATGAGTGCTCGAAAGCTACGCAATCTGTCTTACAAAAAG GCTGTGAGG

AAACAAAAGTTGCTTGAACAGAGTATCCAGTCTGCCCAGGAAATTGAAAAGTCCTTGCACTTAATTCAGGAGTCGCTTGAATTCATTGACAAGCAGTTGG

CAGCTTATATCACTGACAAGGTGGATGCAGCTCAAATGCCTCAGGAAGCCCAG AAAATCCAATCAGATTTGACAAGTCATGAGATAAGTTTAGAAGAAA

TGAAGAAACATAACCAGGGGAAGGATGCCAACCAAAGGGTTCTTTCACAAATTGATGTTGCACAG AAAAAATTACAAGATGTCTCCATGAAATTTCGAT

TATTCCAAAAACCAGCCAATTTTGAACAACGTCTAGAGGAAAGTAAGATGATTTTAGATGAAGTCAAGATGCATTTGCCTGCATTGGAAACCAAGAGTGT

TGAACAGGAAGTAATTCAGTCACAACTAAGTCATTGTGTG AACTTGTATAAAAGCCTGAGTGAAGTCAAGTCTGAAGTGGAAATGGTGATTAAAACCGG

ACGTCAAATTGTACAGAAAAAGCAGACAGAAAATCCCAAAGAGCTTGATGAACGAGTAACAGCTTTGAAATTGCATTACAATGAGTTGGGTGCGAAG GT

AACAGAGAGAAAGCAACAGTTGGAGAAATGCTTGAAGTTGTCCCGTAAGATGAGAAAGGAAATGAATGTCTTA
Translation
MSARKLRNLSYKK AVRKQKLLEQSIQSAQEIEKSLHLIQESLEFIDKQLAAYITDKVDAAQMPQEAQ KIQSDLTSHEISLEEMKKHNQGKDANQRVLS

QIDVAQ KKLQDVSMKFRLFQKPANFEQRLEESKMILDEVKMHLPALETKSVEQEVIQSQLSHCV NLYKSLSEVKSEVEMVIKTGRQIVQKKQTENPKE

LDERVTALKLHYNELGAK VTERKQQLEKCLKLSRKMRKEMNVL
Transcript ENSMUST00000057711
Sequence
GATGTTGCTACCACTTATCCAGACAAGAAGTCCATCTTAATGTACATCACATCACTCTTTCAAGTTTTGCCACAACAAGTGAGCATTGAAGCCATTCAAG

AAGTGGAAATGTTGCCCAGGACATCTTCAAAAGTAACTAGAGAAGAACATTTTCAATTACATCACCAGATGCATTACTCTCAACAG ATCACAGTCAGTC

TAGCACAGGGCTATGAACAAACTTCTTCATCTCCTAAGCCTCGATTCAAGAGTTATGCCTTCACACAGGCTGCTTATGTTGCCACCTCTGATTCCACACA

GAGCCCCTATCCTTCACAG CATTTGGAAGCTCCCAGAGACAAGTCACTTGACAGTTCATTGATGGAGACGGAAGTAAATCTGGATAGTTACCAAACTGC

TTTAGAAGAAGTACTTTCATGGCTTCTTTCTGCCGAGGATACATTGCGAGCACAAGGAGAGATTTCAAATGATGTTGAAGAAGTGAAAGAACAGTTTCAT

GCTCATGAG GGATTCATGATGGATCTGACATCTCATCAAGGACTTGTTGGTAATGTTCTACAGTTAGGAAGTCAACTAGTTGGAAAAGGGAAATTATCA

GAAGATGAAGAAGCTGAAGTGCAAGAACAAATGAATCTCCTAAATTCAAGATGGGAATGTCTCAGGGTAGCTAGCATGGAAAAACAAAGCAA ATTACAC

AAAGTTCTAATGGATCTCCAGAATCAGAAATTAAAAGAACTAGATGACTGGTTAACAAAAACTGAAGAGAGAACTAAGAAAATGGAGGAAGAGCCCTTTG

GACCTGATCTTGAAGATCTAAAATGCCAAGTACAACAACATAAG GTGCTTCAAGAAGATCTAGAACAGGAGCAGGTCAGGGTCAACTCGCTCACTCACA

TGGTAGTAGTGGTTGATGAATCCAGCGGTGATCATGCAACAGCTGCTTTGGAAGAACAACTTAAG GTACTGGGAGATCGATGGGCAAATATCTGCAGAT

GGACTGAAGACCGCTGGATTGTTTTACAAGATATTCTTCTAAAATGGCAGCATTTTACTGAAGAACAG TGCCTTTTTAGTACATGGCTTTCAGAAAAAG

AAGATGCAATGAAGAACATTCAGACAAGTGGCTTTAAAGATCAAAATGAAATGATGTCAAGTCTTCACAAAATATCT ACTTTAAAAATAGATCTAGAAA

AGAAAAAGCCAACCATGGAAAAACTAAGTTCACTCAATCAAGATCTACTTTCGGCACTGAAAAATAAGTCAGTGACTCAAAAGATGGAAATCTGGATGGA

AAACTTTGCACAACGTTGGGACAATTTAACCCAAAAACTTGAAAAGAGTTCAGCACAA ATTTCACAGGCTGTCACCACCACTCAACCATCCCTAACACA

GACAACTGTAATGGAAACGGTAACTATGGTGACCACAAGGGAACAAATCATGGTAAAACATGCCCAAGAGGAACTTCCACCACCACCTCCTCAAAAGAAG

AGGCAGATAACTGTGGATTCTGAACTCAGGAAAAG GTTGGATGTCGATATAACTGAACTTCACAGTTGGATTACTCGTTCAGAAGCTGTATTACAGAGT

TCTGAATTTGCAGTCTATCGAAAAGAAGGCAACATCTCAGACTTGCAAGAAAAAGTCAAT GCCATAGCACGAGAAAAAGCAGAGAAGTTCAGAAAACTG

CAAGATGCCAGCAGATCAGCTCAGGCCCTGGTGGAACAGATGGCAAATG AGGGTGTTAATGCTGAAAGTATCAGACAAGCTTCAGAACAACTGAACAGC

CGGTGGACAGAATTCTGCCAATTGCTGAGTGAGAGAGTTAACTGGCTAGAGTATCAAACCAACATCATTACCTTTTATAATCAGCTACAACAATTGGAAC

AGATGACAACTACTGCCGAAAACTTGTTGAAAACCCAGTCTACCACCCTATCAGAGCCAACAGCAATTAAAAGCCAGTTAAAAATTTGTAAG GATGAAG

TCAACAGATTGTCAGCTCTTCAGCCTCAAATTGAGCAATTAAAAATTCAGAGTCTACAACTGAAAGAAAAGGGACAGGGGCCAATGTTTCTGGATGCAGA

CTTTGTGGCCTTTACTAATCATTTTAACCACATCTTTGATGGTGTGAGGGCCAAAGAGAAAGAGCTACAGACAA TTTTTGACACTTTACCACCAATGCG

CTATCAGGAGACAATGAGTAGCATCAGGACGTGGATCCAGCAGTCAGAAAGCAAACTCTCTGTACCTTATCTTAGTGTTACTGAATATGAAATAATGGAG

GAGAGACTCGGGAAATTACAG GCTCTGCAAAGTTCTTTGAAAGAGCAACAAAATGGCTTCAACTATCTGAGTGACACTGTGAAGGAGATGGCCAAGAAA

GCACCTTCAGAAATATGCCAGAAATATCTGTCAGAATTTGAAGAGATTGAGGGGCACTGGAAGAAACTTTCCTCCCAGTTGGTGGAAAGCTGCCAAAAGC

TAGAAGAACATATGAATAAACTTCGAAAATTTCAG AATCACATAAAAACCTTACAGAAATGGATGGCTGAAGTTGATGTTTTCCTGAAAGAGGAATGGC

CTGCCCTGGGGGATGCTGAAATCCTGAAAAAACAGCTCAAACAATGCAGA CTTTTAGTTGGTGATATTCAAACAATTCAGCCCAGTTTAAATAGTGTTA

ATGAAGGTGGGCAGAAGATAAAGAGTGAAGCTGAACTTGAGTTTGCATCCAGACTGGAGACAGAACTTAGAGAGCTTAACACTCAGTGGGATCACATATG

CCGCCAG GTCTACACCAGAAAGGAAGCCTTAAAGGCAGGTTTGGATAAAACCGTAAGCCTCCAAAAAGATCTATCAGAGATGCATGAGTGGATGACACA

AGCTGAAGAAGAATATCTAGAGAGAGATTTTGAATATAAAACTCCAGATGAATTACAGACTGCTGTTGAAGAAATGAAG AGAGCTAAAGAAGAGGCACT

ACAAAAAGAAACTAAAGTGAAACTCCTTACTGAGACTGTAAATAGTGTAATAGCTCACGCTCCACCCTCAGCACAAGAGGCCTTAAAAAAGGAACTTGAA

ACTCTGACCACCAACTACCAATGGCTGTGCACCAGGCTGAATGGAAAATGCAAAACTTTGGAA GAAGTTTGGGCATGTTGGCATGAGTTATTGTCATAT

TTAGAGAAAGCAAACAAGTGGCTCAATGAAGTAGAATTGAAACTTAAAACCATGGAAAATGTTCCTGCAGGACCTGAGGAAATCACTGAAGTGCTAGAA 

TCTCTTGAAAATCTGATGCATCATTCAGAGGAGAACCCAAATCAGATTCGTCTATTGGCACAGACTCTTACAGATGGAGGAGTCATGGATGAACTGATCA

ATGAGGAGCTTGAGACGTTTAATTCTCGTTGGAGGGAACTACATGAAGAG GCTGTGAGGAAACAAAAGTTGCTTGAACAGAGTATCCAGTCTGCCCAGG

AAATTGAAAAGTCCTTGCACTTAATTCAGGAGTCGCTTGAATTCATTGACAAGCAGTTGGCAGCTTATATCACTGACAAGGTGGATGCAGCTCAAATGCC

TCAGGAAGCCCAG AAAATCCAATCAGATTTGACAAGTCATGAGATAAGTTTAGAAGAAATGAAGAAACATAACCAGGGGAAGGATGCCAACCAAAGGGT

TCTTTCACAAATTGATGTTGCACAG AAAAAATTACAAGATGTCTCCATGAAATTTCGATTATTCCAAAAACCAGCCAATTTTGAACAACGTCTAGAGGA

AAGTAAGATGATTTTAGATGAAGTCAAGATGCATTTGCCTGCATTGGAAACCAAGAGTGTTGAACAGGAAGTAATTCAGTCACAACTAAGTCATTGTGTG

 AACTTGTATAAAAGCCTGAGTGAAGTCAAGTCTGAAGTGGAAATGGTGATTAAAACCGGACGTCAAATTGTACAGAAAAAGCAGACAGAAAATCCCAAA

GAGCTTGATGAACGAGTAACAGCTTTGAAATTGCATTACAATGAGTTGGGTGCGAAG GTAACAGAGAGAAAGCAACAGTTGGAGAAATGCTTGAAGTTG

TCCCGTAAGATGAGAAAGGAAATGAATGTCTTAACAGAATGGCTGGCAGCAACAGATACAGAATTGACGAAGAGATCAGCAGTTGAAGGAATGCCAAGTA

ATTTGGATTCTGAAGTTGCCTGGGGAAAG GCTACTCAAAAAGAGATTGAGAAACAGAAGGCTCACTTGAAGAGTGTTACAGAATTAGGAGAGTCTTTGA

AAATGGTGTTGGGCAAGAAAGAAACCTTGGTAGAAGATAAACTGAGTCTTCTGAACAGTAACTGGATAGCTGTCACCTCCAGAGTAGAAGAATGGCTAAA

TCTTTTGTTG GAATACCAGAAACACATGGAAACCTTTGATCAGAACATAGAACAAATCACAAAGTGGATCATTCATGCAGATGAACTTTTAGATGAGTC

TGAAAAGAAGAAACCACAACAAAAGGAAGACATTCTTAAG CGTTTAAAGGCTGAAATGAATGACATGCGCCCAAAGGTGGACTCCACACGTGACCAAGC

AGCAAAATTGATGGCAAACCGCGGTGACCACTGCAGGAAAGTAGTAGAGCCCCAAATCTCTGAGCTCAACCGTCGATTTGCAGCTATTTCTCACAGAATT

AAGACTGGAAAG GCCTCCATTCCTTTGAAGGAATTGGAGCAGTTTAACTCAGATATACAAAAATTGCTTGAACCACTGGAGGCTGAAATTCAGCAGGGG

GTGAATCTGAAAGAGGAAGACTTCAATAAAGATATG AGTGAAGACAATGAGGGTACTGTAAATGAATTGTTGCAAAGAGGAGACAACTTACAACAAAGA

ATCACAGATGAGAGAAAGCGAGAGGAAATAAAGATAAAACAGCAGCTGTTACAGACAAAACATAATGCTCTCAAG GATTTGAGGTCTCAAAGAAGAAAA

AAGGCCCTAGAAATTTCTCACCAGTGGTATCAGTACAAGAGGCAGGCTGATGATCTCCTGAAATGCTTGGATGAAATTGAAAAAAAATTAGCCAGCCTAC

CTGAACCCAGAGATGAAAGAAAATTAAAG GAAATTGATCGTGAATTGCAGAAGAAGAAAGAGGAGCTGAATGCAGTGCGCAGGCAAGCTGAGGGCTTGT

CTGAGAATGGGGCCGCAATGGCAGTGGAGCCAACTCAGATCCAGCTCAGCAAGCGCTGGCGGCAAATTGAGAGCAATTTTGCTCAGTTTCGAAGACTCAA

CTTTGCACAAATT CACACTCTCCATGAAGAAACTATGGTAGTGACGACTGAAGATATGCCTTTGGATGTTTCTTATGTGCCTTCTACTTATTTGACCGA

GATCAGTCATATCTTACAAGCTCTTTCAGAAGTTGATCATCTTCTAAATACTCCTGAACTCTGTGCTAAAGATTTTGAAGATCTTTTTAAGCAAGAGGAG

TCTCTTAAG AATATAAAAGACAATTTGCAACAAATCTCAGGTCGGATTGATATTATTCACAAGAAGAAGACAGCAGCCTTGCAAAGTGCCACCTCCATG

GAAAAGGTGAAAGTACAGGAAGCCGTGGCACAGATGGATTTCCAGGGGGAAAAACTTCATAGAATGTACAAGGAACGACAAGG GCGATTCGACAGATCA

GTTGAAAAATGGCGACACTTTCATTATGATATGAAGGTATTTAATCAATGGCTGAATGAAGTTGAACAGTTTTTCAAAAAGACACAAAATCCTGAAAACT

GGGAACATGCTAAATACAAATGGTATCTTAAG GAACTCCAGGATGGCATTGGGCAGCGTCAAGCTGTTGTCAGAACACTGAATGCAACTGGGGAAGAAA

TAATTCAACAGTCTTCAAAAACAGATGTCAATATTCTACAAGAAAAATTAGGAAGCTTGAGTCTGCGGTGGCACGACATCTGCAAAGAGCTGGCAGAAAG

GAGAAAGAG GATTGAAGAACAAAAGAATGTCTTGTCAGAATTTCAAAGAGATTTAAATGAATTTGTTTTGTGGCTGGAAGAAGCAGATAACATTGCTAT

TACTCCACTTGGAGATGAGCAGCAGCTAAAAGAACAACTTGAACAAGTCAAG TTACTGGCAGAAGAGTTGCCCCTGCGCCAGGGAATTCTAAAACAATT

AAATGAAACAGGAGGAGCAGTACTTGTAAGTGCTCCCATAAGGCCAGAAGAGCAAGATAAACTTGAAAAGAAGCTCAAACAGACAAATCTCCAGTGGATA

AAG GTCTCCAGAGCTTTACCTGAGAAACAAGGAGAGCTTGAGGTTCACTTAAAAGATTTTAGGCAGCTTGAAGAGCAGCTGGATCACCTGCTTCTGTGG

CTCTCTCCTATTAGAAACCAGTTGGAAATTTATAACCAACCAAGTCAGGCAGGACCGTTTGACATAAAG GAGATTGAAGTAACAGTTCACGGTAAACAA

GCGGATGTGGAAAGGCTTTTGTCGAAAGGGCAGCATTTGTATAAGGAAAAACCAAGCACTCAGCCAGTGAAG AGGAAGTTAGAAGATCTGAGGTCTGAG

TGGGAGGCTGTAAACCATTTACTTCGGGAGCTGAGGACAAAGCAGCCTGACCGTGCCCCTGGACTGAGCACTACTGGAGCCT - UNI33489 - CTGC

CAGTCAGACTGTTACTCTAGTGACACAATCTGTGGTTACTAAGGAAACTGTCATCTCCAAACTAGAAATGCCATCTTCTTTGCTGTTGGAGGTACCTGCA

CTGGCAGACTTCAACCGAGCTTGGACAGAACTTACAGACTGGCTGTCTCTGCTTGATCGAGTTATAAAATCACAGAGAGTGATGGTGGGTGATCTGGAAG

ACATCAATGAAATGATCATCAAACAGAAG - UNI32672 - GCAACACTGCAAGATTTGGAACAGAGACGCCCCCAATTGGAAGAACTCATTACTGCT

GCCCAGAATTTGAAAAACAAAACCAGCAATCAAGAAGCTAGAACAATCATTACTGATCGAA TTGAAAGAATTCAGATTCAGTGGGATGAGGTTCAAGAA

CAGCTGCAGAACAGGAGACAACAGTTGAATGAAATGTTAAAGGATTCAACACAATGGCTGGAAGCTAAGGAAGAAGCCGAACAGGTCATAGGACAGGTCA

GAGGCAAGCTTGACTCATGGAAAGAAGGTCCTCACACAGTAGATGCAATCCAAAAGAAGATCACAGAAACCAAG CAGTTGGCCAAAGACCTCCGTCAAC

GGCAGATAAGTGTAGACGTGGCAAATGATTTGGCACTGAAACTTCTTCGGGACTATTCTGCTGATGATACCAGAAAAGTACACATGATAACAGAGAATAT

CAATACTTCTTGGGGAAACATTCATAAAAG AGTAAGTGAGCAAGAGGCTGCTTTGGAAGAAACTCATAGATTACTGCAGCAGTTCCCTCTGGACCTGGA

GAAGTTTCTTTCCTGGATTACGGAAGCAGAAACAACTGCCAATGTCCTACAGGACGCTTCCCGTAAGGAGAAGCTCCTAGAAGACTCCAGGGGAGTCAGA

GAGCTGATGAAACCATGGCAA GATCTCCAAGGAGAAATTGAAACTCACACAGATATCTATCACAATCTTGATGAAAATGGCCAAAAAATCCTGAGATCC

CTGGAAGGTTCGGATGAAGCACCCCTGTTACAAAGACGTTTGGATAACATGAATTTCAAGTGGAGTGAACTTCAGAAAAAGTCTCTCAACATTAG GTCC

CATTTGGAAGCAAGTTCTGACCAGTGGAAGCGTTTGCATCTTTCTCTTCAGGAACTTCTTGTTTGGCTACAGCTGAAAGATGATGAACTGAGCCGTCAGG

CACCCATCGGTGGTGATTTCCCAGCAGTTCAGAAGCAGAATGATATACATAGG GCCTTCAAGAGGGAATTGAAAACTAAAGAACCTGTAATCATGAGTA

CTCTGGAGACTGTGAGAATATTTCTGACAGAGCAGCCTTTGGAAGGACTAGAGAAACTCTACCAGGAGCCCAGAG AACTGCCTCCTGAAGAAAGAGCTC

AGAATGTCACTCGGCTCCTACGAAAGCAGGCTGAAGAGGTCAACGCTGAATGGGACAAATTGAACCTGCGCTCAGCTGATTGGCAGAGAAAAATAGATGA

AGCTCTTGAAAGACTCCAGGAACTTCAGGAAGCTGCCGATGAACTGGACCTCAAGTTGCGCCAAGCTGAGGTGATCAAGGGATCCTGGCAGCCAGTGGGG

GATCTCCTCATTGACTCTCTGCAAGATCACCTTGAAAAAGTCAAG GCACTTCGGGGAGAAATTGCACCTCTTAAAGAGAATGTCAATCGTGTCAATGAC

CTTGCACATCAGCTGACCACACTGGGCATTCAGCTCTCACCTTATAACCTCAGCACTTTGGAAGATCTGAATACCAGATGGAGGCTTCTACAG - UNI3

5867 - GTGGCTGTGGAGGACCGTGTCAGACAGCTGCATGAAGCCCACAGGGACTTTGGTCCTGCATCCCAGCACTTCCTTTCCA CTTCAGTTCAGGG

TCCCTGGGAGAGAGCCATCTCACCAAACAAAGTGCCCTACTATATCAA - UNI22807 - - UNI31850 - CCACGAGACCCAAACCACTTGTTGG

GACCACCCCAAAATGACAGAGCTCTACCAGTCTTTAG - UNI6101 - CTGACCTGAATAATGTCAGGTTCTCCGCGTATAGGACTGCCATGAAGCTC

AGAAGGCTCCAGAAGGCCCTTTGCT - UNI22934 - TGGATCTCTTGAGCCTGTCAGCTGCATGTGATGCCCTGGACCAGCACAACCTCAAGCAAAA

TGACCAGCCCATGGATATCCTGCAGATAATTAACTGTTTGACTACAATTTATGATCGTCTGGAGCAAGAGCACAACAATCTGGTCAATGTCCCTCTCTGT

GTGGATATGTGTCTCAACTGGCTTCTCAATGTTTATGATAC - UNI23949 - GGGACGAACAGGGAGGATCCGTGTCCTGTCTTTTAAAACTGGCAT

CATTTCTCTGTGTAAAGCACACTTGGAAGACAAGTACAGAT ACCTTTTCAAGCAAGTGGCAAGTTCAACTGGCTTTTGTGACCAGCGTAGGCTGGGTCT

TCTTCTGCATGATTCTATTCAAATCCCAAGACAGTTGGGTGAAGTTGCTTCCTTTGGGGGCAGTAACATTGAGCCGAGTGTCAGGAGCTGCTTCCAATTT

 - UNI4390 - GCCAATAATAAACCTGAGATTGAAGCTGCTCTCTTCCTTGACTGGATGCGCCTGGAACCCCAGTCTATGGTGTGGCTGCCCGTCTTG

CACAGAGTGGCTGCTGCTGAAACTGCCAAGCATCAAGCCAAGTGTAACATCTGTAAGGAGTGTCCAATCATTGGATTCAG GTACAGAAGCCTAAAGCAT

TTTAATTATGACATCTGCCAAAGTTGCTTTTTTTCTGGCCGAGTTGCAAAGGGCCATAAAATGCACTACCCCATGGTAGAGTATTGCACTCCG ACTACA

TCCGGAGAAGATGTTCGCGACTTCGCCAAGGTACTAAAAAACAAATTTCGAACCAAAAGGTATTTTGCGAAGCATCCCCGAATGGGCTACCTGCCAGTGC

AGACTGTGTTAGAGGGGGACAACATGGAAAC TCCCGTTACTCTGATCAACTTCTGGCCAGTAGATTCTGC GCCTGCCTCGTCCCCCCAGCTTTCACAC

GATGATACTCATTCACGCATTGAACATTATGCTAGCAG GCTAGCAGAAATGGAAAACAGCAATGGATCTTATCTAAATGATAGCATCTCTCCTAATGAG

AGCAT AGATGATGAACATTTGTTAATCCAGCATTACTGCCAAAGTTTGAACCAGGACTCCCCCCTGAGCCAGCCTCGTAGTCCTGCCCAGATCTTGATT

TCCTTAGAGAGTGAGGAAAGAGGGGAGCTAGAGAGAATCCTAGCAGATCTTGAGGAAGAAAACAG GAATCTGCAAGCAGAATATGATCGCCTGAAGCAG

CAGCATGAGCATAAAGGCCTGTCTCCACTGCCATCTCCTCCTGAGATGATGCCCACCTCTCCTCAGAGTCCCAGGGATGCTGAGCTCATTGCTGAGGCTA

AGCTACTGCGCCAACACAAAGGACGCCTGGAAGCCAGGATGCAAATCCTGGAAGACCACAATAAACAGCTGGAGTCTCAGTTACATAGACTGAGACAGCT

CCTGGAGCAG CCCCAGGCTGAAGCTAAGGTGAATGGCACCACGGTGTCCTCTCCTTCCACCTCTCTGCAGAGGTCAGATAGCAGTCAGCCTATGCTGCT

CCGAGTGGTTGGCAGTCAAACTTCAGAATCTATGG - UNI5399 - GTGAGGAAGATCTTCTGAGTCCTCCCCAGGACACAAGCACAGGGTTAGAAGA

AGTGATGGAGCAACTCAACAACTCCTTCCCTAGTTCAAGAG - UNI9581 - GAAGAAATGCCCCCGGAAAGCCAATGAGAGAGGTTAGTGAG
Translation
DVATTYPDKKSILMYITSLFQVLPQQVSIEAIQEVEMLPRTSSKVTREEHFQLHHQMHYSQQ ITVSLAQGYEQTSSSPKPRFKSYAFTQAAYVATSDST

QSPYPSQ HLEAPRDKSLDSSLMETEVNLDSYQTALEEVLSWLLSAEDTLRAQGEISNDVEEVKEQFHAHE GFMMDLTSHQGLVGNVLQLGSQLVGKGK

LSEDEEAEVQEQMNLLNSRWECLRVASMEKQSK KLHKVLMDLQNQKLKELDDWLTKTEERTKKMEEEPFGPDLEDLKCQVQQHK VLQEDLEQEQVRVN

SLTHMVVVVDESSGDHATAALEEQLK VLGDRWANICRWTEDRWIVLQDILLKWQHFTEEQ CLFSTWLSEKEDAMKNIQTSGFKDQNEMMSSLHKIS T

LKIDLEKKKPTMEKLSSLNQDLLSALKNKSVTQKMEIWMENFAQRWDNLTQKLEKSSAQ ISQAVTTTQPSLTQTTVMETVTMVTTREQIMVKHAQEELP

PPPPQKKRQITVDSELRKR RLDVDITELHSWITRSEAVLQSSEFAVYRKEGNISDLQEKVN AIAREKAEKFRKLQDASRSAQALVEQMANE EGVNAE

SIRQASEQLNSRWTEFCQLLSERVNWLEYQTNIITFYNQLQQLEQMTTTAENLLKTQSTTLSEPTAIKSQLKICK DEVNRLSALQPQIEQLKIQSLQLK

EKGQGPMFLDADFVAFTNHFNHIFDGVRAKEKELQTI IFDTLPPMRYQETMSSIRTWIQQSESKLSVPYLSVTEYEIMEERLGKLQ ALQSSLKEQQNG

FNYLSDTVKEMAKKAPSEICQKYLSEFEEIEGHWKKLSSQLVESCQKLEEHMNKLRKFQ NHIKTLQKWMAEVDVFLKEEWPALGDAEILKKQLKQCR L

LVGDIQTIQPSLNSVNEGGQKIKSEAELEFASRLETELRELNTQWDHICRQ VYTRKEALKAGLDKTVSLQKDLSEMHEWMTQAEEEYLERDFEYKTPDE

LQTAVEEMK RAKEEALQKETKVKLLTETVNSVIAHAPPSAQEALKKELETLTTNYQWLCTRLNGKCKTLE EVWACWHELLSYLEKANKWLNEVELKLK

TMENVPAGPEEITEVLE SLENLMHHSEENPNQIRLLAQTLTDGGVMDELINEELETFNSRWRELHEE AVRKQKLLEQSIQSAQEIEKSLHLIQESLEF

IDKQLAAYITDKVDAAQMPQEAQ KIQSDLTSHEISLEEMKKHNQGKDANQRVLSQIDVAQ KKLQDVSMKFRLFQKPANFEQRLEESKMILDEVKMHLP

ALETKSVEQEVIQSQLSHCV NLYKSLSEVKSEVEMVIKTGRQIVQKKQTENPKELDERVTALKLHYNELGAK VTERKQQLEKCLKLSRKMRKEMNVLT

EWLAATDTELTKRSAVEGMPSNLDSEVAWGK ATQKEIEKQKAHLKSVTELGESLKMVLGKKETLVEDKLSLLNSNWIAVTSRVEEWLNLLL EYQKHME

TFDQNIEQITKWIIHADELLDESEKKKPQQKEDILK RLKAEMNDMRPKVDSTRDQAAKLMANRGDHCRKVVEPQISELNRRFAAISHRIKTGK ASIPL

KELEQFNSDIQKLLEPLEAEIQQGVNLKEEDFNKDM SEDNEGTVNELLQRGDNLQQRITDERKREEIKIKQQLLQTKHNALK DLRSQRRKKALEISHQ

WYQYKRQADDLLKCLDEIEKKLASLPEPRDERKLK EIDRELQKKKEELNAVRRQAEGLSENGAAMAVEPTQIQLSKRWRQIESNFAQFRRLNFAQI HT

LHEETMVVTTEDMPLDVSYVPSTYLTEISHILQALSEVDHLLNTPELCAKDFEDLFKQEESLK NIKDNLQQISGRIDIIHKKKTAALQSATSMEKVKVQ

EAVAQMDFQGEKLHRMYKERQG GRFDRSVEKWRHFHYDMKVFNQWLNEVEQFFKKTQNPENWEHAKYKWYLK ELQDGIGQRQAVVRTLNATGEEIIQQ

SSKTDVNILQEKLGSLSLRWHDICKELAERRKR RIEEQKNVLSEFQRDLNEFVLWLEEADNIAITPLGDEQQLKEQLEQVK LLAEELPLRQGILKQLN

ETGGAVLVSAPIRPEEQDKLEKKLKQTNLQWIK VSRALPEKQGELEVHLKDFRQLEEQLDHLLLWLSPIRNQLEIYNQPSQAGPFDIK EIEVTVHGKQ

ADVERLLSKGQHLYKEKPSTQPVK RKLEDLRSEWEAVNHLLRELRTKQPDRAPGLSTTGAS - UNI33489 - SASQTVTLVTQSVVTKETVISKLE

MPSSLLLEVPALADFNRAWTELTDWLSLLDRVIKSQRVMVGDLEDINEMIIKQK - UNI32672 - ATLQDLEQRRPQLEELITAAQNLKNKTSNQEA

RTIITDRI IERIQIQWDEVQEQLQNRRQQLNEMLKDSTQWLEAKEEAEQVIGQVRGKLDSWKEGPHTVDAIQKKITETK QLAKDLRQRQISVDVANDL

ALKLLRDYSADDTRKVHMITENINTSWGNIHKR RVSEQEAALEETHRLLQQFPLDLEKFLSWITEAETTANVLQDASRKEKLLEDSRGVRELMKPWQ D

LQGEIETHTDIYHNLDENGQKILRSLEGSDEAPLLQRRLDNMNFKWSELQKKSLNIR RSHLEASSDQWKRLHLSLQELLVWLQLKDDELSRQAPIGGDF

PAVQKQNDIHR AFKRELKTKEPVIMSTLETVRIFLTEQPLEGLEKLYQEPRE ELPPEERAQNVTRLLRKQAEEVNAEWDKLNLRSADWQRKIDEALER

LQELQEAADELDLKLRQAEVIKGSWQPVGDLLIDSLQDHLEKVK ALRGEIAPLKENVNRVNDLAHQLTTLGIQLSPYNLSTLEDLNTRWRLLQ - UNI

35867 - VAVEDRVRQLHEAHRDFGPASQHFLST TSVQGPWERAISPNKVPYYIN - UNI22807 - - UNI31850 - NHETQTTCWDHPKMTE

LYQSLA - UNI6101 - ADLNNVRFSAYRTAMKLRRLQKALCL - UNI22934 - LDLLSLSAACDALDQHNLKQNDQPMDILQIINCLTTIYDRL

EQEHNNLVNVPLCVDMCLNWLLNVYDT - UNI23949 - TGRTGRIRVLSFKTGIISLCKAHLEDKYRY YLFKQVASSTGFCDQRRLGLLLHDSIQI

PRQLGEVASFGGSNIEPSVRSCFQF - UNI4390 - ANNKPEIEAALFLDWMRLEPQSMVWLPVLHRVAAAETAKHQAKCNICKECPIIGFR RYRSL

KHFNYDICQSCFFSGRVAKGHKMHYPMVEYCTP TTSGEDVRDFAKVLKNKFRTKRYFAKHPRMGYLPVQTVLEGDNMET TPVTLINFWPVDSA APAS

SPQLSHDDTHSRIEHYASR RLAEMENSNGSYLNDSISPNESI IDDEHLLIQHYCQSLNQDSPLSQPRSPAQILISLESEERGELERILADLEEENR R

NLQAEYDRLKQQHEHKGLSPLPSPPEMMPTSPQSPRDAELIAEAKLLRQHKGRLEARMQILEDHNKQLESQLHRLRQLLEQ PQAEAKVNGTTVSSPSTS

LQRSDSSQPMLLRVVGSQTSESMG - UNI5399 - GEEDLLSPPQDTSTGLEEVMEQLNNSFPSSRG - UNI9581 - GRNAPGKPMREVSE
Transcript ENSMUST00000113998
Sequence
TTTGGAAAGCAACACATAGACAACCTCTTCAGTGACCTGCAGGATGGAAAACGCCTCCTAGACCTCTTGGAAGGCCTTACAGGGCAAAAACTG CCAAAA

GAAAAGGGATCTACAAGAGTTCATGCCCTGAACAATGTCAACAAGGCACTGCGGGTCTTACAGAAAAATAAT GTTGATTTAGTGAATATAGGAAGCACT

GACATAGTGGATGGAAATCATAAACTCACT
Translation
FGKQHIDNLFSDLQDGKRLLDLLEGLTGQKL PKEKGSTRVHALNNVNKALRVLQKNN VDLVNIGSTDIVDGNHKLT
Transcript ENSMUST00000114000
Sequence
CTCACTCACTTGCCCCTTACAGGACTCAGCTCTTGAAGGCAATAGCCTTTATAGAAAAAACGAATAGGAAGACTTGAAGTGCTATTTTTTTTTTTGTCAA

GGCTGCTGAAGTTTATTGGCTTCTCATCGTACCTAAGCCTCCTGGAGCAATAAAACTGGGAGAAACTTTTACCAAGATTTTTATCCCTGCCTTGATATAT

ACTTTTTCTTCCAAATGCTTTGGTGGGAAGAAGTAGAGGACTGTT - UNI30637 - ATGAAAGAGAAGATGTTCAAAAGAAAACATTCACAAAATGG

ATAAATGCACAATTTTCTAAG TTTGGAAAGCAACACATAGACAACCTCTTCAGTGACCTGCAGGATGGAAAACGCCTCCTAGACCTCTTGGAAGGCCTT

ACAGGGCAAAAACTG CCAAAAGAAAAGGGATCTACAAGAGTTCATGCCCTGAACAATGTCAACAAGGCACTGCGGGTCTTACAGAAAAATAAT GTTGA

TTTAGTGAATATAGGAAGCACTGACATAGTGGATGGAAATCATAAACTCACTCTTGGTTTGATTTGGAATATAATCCTCCACTGGCAG GTCAAAAATGT

GATGAAAACTATCATGGCTGGATTGCAGCAAACCAACAGTGAAAAGATTCTTCTGAGCTGGGTTCGACAGTCAACACGTAATTATCCACAGGTTAACGTC

ATCAACTTCACCTCTAGCTGGTCCGACGGGTTGGCTTTGAATGCTCTTATCCATAGTCACAG GCCCGACCTGTTTGATTGGAATAGTGTGGTTTCACAG

CACTCAGCCACCCAAAGACTGGAACATGCCTTCAACATTGCAAAATGCCAGTTAGGCATAGAAAAACTTCTTGATCCTGAAG ATGTTGCTACCACTTAT

CCAGACAAGAAGTCCATCTTAATGTACATCACATCACTCTTTCAAGTTTTGCCACAACAAGTGAGCATTGAAGCCATTCAAGAAGTGGAAATGTTGCCCA

GGACATCTTCAAAAGTAACTAGAGAAGAACATTTTCAATTACATCACCAGATGCATTACTCTCAACAG ATCACAGTCAGTCTAGCACAGGGCTATGAAC

AAACTTCTTCATCTCCTAAGCCTCGATTCAAGAGTTATGCCTTCACACAGGCTGCTTATGTTGCCACCTCTGATTCCACACAGAGCCCCTATCCTTCACA

G CATTTGGAAGCTCCCAGAGACAAGTCACTTGACAGTTCATTGATGGAGACGGAAGTAAATCTGGATAGTTACCAAACTGCTTTAGAAGAAGTACTTTC

ATGGCTTCTTTCTGCCGAGGATACATTGCGAGCACAAGGAGAGATTTCAAATGATGTTGAAGAAGTGAAAGAACAGTTTCATGCTCATGAG GGATTCAT

GATGGATCTGACATCTCATCAAGGACTTGTTGGTAATGTTCTACAGTTAGGAAGTCAACTAGTTGGAAAAGGGAAATTATCAGAAGATGAAGAAGCTGAA

GTGCAAGAACAAATGAATCTCCTAAATTCAAGATGGGAATGTCTCAGGGTAGCTAGCATGGAAAAACAAAGCAA ATTACACAAAGTTCTAATGGATCTC

CAGAATCAGAAATTAAAAGAACTAGATGACTGGTTAACAAAAACTGAAGAGAGAACTAAGAAAATGGAGGAAGAGCCCTTTGGACCTGATCTTGAAGATC

TAAAATGCCAAGTACAACAACATAAG GTGCTTCAAGAAGATCTAGAACAGGAGCAGGTCAGGGTCAACTCGCTCACTCACATGGTAGTAGTGGTTGATG

AATCCAGCGGTGATCATGCAACAGCTGCTTTGGAAGAACAACTTAAG GTACTGGGAGATCGATGGGCAAATATCTGCAGATGGACTGAAGACCGCTGGA

TTGTTTTACAAGATATTCTTCTAAAATGGCAGCATTTTACTGAAGAACAG TGCCTTTTTAGTACATGGCTTTCAGAAAAAGAAGATGCAATGAAGAACA

TTCAGACAAGTGGCTTTAAAGATCAAAATGAAATGATGTCAAGTCTTCACAAAATATCT ACTTTAAAAATAGATCTAGAAAAGAAAAAGCCAACCATGG

AAAAACTAAGTTCACTCAATCAAGATCTACTTTCGGCACTGAAAAATAAGTCAGTGACTCAAAAGATGGAAATCTGGATGGAAAACTTTGCACAACGTTG

GGACAATTTAACCCAAAAACTTGAAAAGAGTTCAGCACAA ATTTCACAGGCTGTCACCACCACTCAACCATCCCTAACACAGACAACTGTAATGGAAAC

GGTAACTATGGTGACCACAAGGGAACAAATCATGGTAAAACATGCCCAAGAGGAACTTCCACCACCACCTCCTCAAAAGAAGAGGCAGATAACTGTGGAT

TCTGAACTCAGGAAAAG GTTGGATGTCGATATAACTGAACTTCACAGTTGGATTACTCGTTCAGAAGCTGTATTACAGAGTTCTGAATTTGCAGTCTAT

CGAAAAGAAGGCAACATCTCAGACTTGCAAGAAAAAGTCAAT GCCATAGCACGAGAAAAAGCAGAGAAGTTCAGAAAACTGCAAGATGCCAGCAGATCA

GCTCAGGCCCTGGTGGAACAGATGGCAAATG AGGGTGTTAATGCTGAAAGTATCAGACAAGCTTCAGAACAACTGAACAGCCGGTGGACAGAATTCTGC

CAATTGCTGAGTGAGAGAGTTAACTGGCTAGAGTATCAAACCAACATCATTACCTTTTATAATCAGCTACAACAATTGGAACAGATGACAACTACTGCCG

AAAACTTGTTGAAAACCCAGTCTACCACCCTATCAGAGCCAACAGCAATTAAAAGCCAGTTAAAAATTTGTAAG GATGAAGTCAACAGATTGTCAGCTC

TTCAGCCTCAAATTGAGCAATTAAAAATTCAGAGTCTACAACTGAAAGAAAAGGGACAGGGGCCAATGTTTCTGGATGCAGACTTTGTGGCCTTTACTAA

TCATTTTAACCACATCTTTGATGGTGTGAGGGCCAAAGAGAAAGAGCTACAGACAA TTTTTGACACTTTACCACCAATGCGCTATCAGGAGACAATGAG

TAGCATCAGGACGTGGATCCAGCAGTCAGAAAGCAAACTCTCTGTACCTTATCTTAGTGTTACTGAATATGAAATAATGGAGGAGAGACTCGGGAAATTA

CAG GCTCTGCAAAGTTCTTTGAAAGAGCAACAAAATGGCTTCAACTATCTGAGTGACACTGTGAAGGAGATGGCCAAGAAAGCACCTTCAGAAATATGC

CAGAAATATCTGTCAGAATTTGAAGAGATTGAGGGGCACTGGAAGAAACTTTCCTCCCAGTTGGTGGAAAGCTGCCAAAAGCTAGAAGAACATATGAATA

AACTTCGAAAATTTCAG AATCACATAAAAACCTTACAGAAATGGATGGCTGAAGTTGATGTTTTCCTGAAAGAGGAATGGCCTGCCCTGGGGGATGCTG

AAATCCTGAAAAAACAGCTCAAACAATGCAGA CTTTTAGTTGGTGATATTCAAACAATTCAGCCCAGTTTAAATAGTGTTAATGAAGGTGGGCAGAAGA

TAAAGAGTGAAGCTGAACTTGAGTTTGCATCCAGACTGGAGACAGAACTTAGAGAGCTTAACACTCAGTGGGATCACATATGCCGCCAG GTCTACACCA

GAAAGGAAGCCTTAAAGGCAGGTTTGGATAAAACCGTAAGCCTCCAAAAAGATCTATCAGAGATGCATGAGTGGATGACACAAGCTGAAGAAGAATATCT

AGAGAGAGATTTTGAATATAAAACTCCAGATGAATTACAGACTGCTGTTGAAGAAATGAAG AGAGCTAAAGAAGAGGCACTACAAAAAGAAACTAAAGT

GAAACTCCTTACTGAGACTGTAAATAGTGTAATAGCTCACGCTCCACCCTCAGCACAAGAGGCCTTAAAAAAGGAACTTGAAACTCTGACCACCAACTAC

CAATGGCTGTGCACCAGGCTGAATGGAAAATGCAAAACTTTGGAA GAAGTTTGGGCATGTTGGCATGAGTTATTGTCATATTTAGAGAAAGCAAACAAG

TGGCTCAATGAAGTAGAATTGAAACTTAAAACCATGGAAAATGTTCCTGCAGGACCTGAGGAAATCACTGAAGTGCTAGAA TCTCTTGAAAATCTGATG

CATCATTCAGAGGAGAACCCAAATCAGATTCGTCTATTGGCACAGACTCTTACAGATGGAGGAGTCATGGATGAACTGATCAATGAGGAGCTTGAGACGT

TTAATTCTCGTTGGAGGGAACTACATGAAGAG GCTGTGAGGAAACAAAAGTTGCTTGAACAGAGTATCCAGTCTGCCCAGGAAATTGAAAAGTCCTTGC

ACTTAATTCAGGAGTCGCTTGAATTCATTGACAAGCAGTTGGCAGCTTATATCACTGACAAGGTGGATGCAGCTCAAATGCCTCAGGAAGCCCAG AAAA

TCCAATCAGATTTGACAAGTCATGAGATAAGTTTAGAAGAAATGAAGAAACATAACCAGGGGAAGGATGCCAACCAAAGGGTTCTTTCACAAATTGATGT

TGCACAG AAAAAATTACAAGATGTCTCCATGAAATTTCGATTATTCCAAAAACCAGCCAATTTTGAACAACGTCTAGAGGAAAGTAAGATGATTTTAGA

TGAAGTCAAGATGCATTTGCCTGCATTGGAAACCAAGAGTGTTGAACAGGAAGTAATTCAGTCACAACTAAGTCATTGTGTG AACTTGTATAAAAGCCT

GAGTGAAGTCAAGTCTGAAGTGGAAATGGTGATTAAAACCGGACGTCAAATTGTACAGAAAAAGCAGACAGAAAATCCCAAAGAGCTTGATGAACGAGTA

ACAGCTTTGAAATTGCATTACAATGAGTTGGGTGCGAAG GTAACAGAGAGAAAGCAACAGTTGGAGAAATGCTTGAAGTTGTCCCGTAAGATGAGAAAG

GAAATGAATGTCTTAACAGAATGGCTGGCAGCAACAGATACAGAATTGACGAAGAGATCAGCAGTTGAAGGAATGCCAAGTAATTTGGATTCTGAAGTTG

CCTGGGGAAAG GCTACTCAAAAAGAGATTGAGAAACAGAAGGCTCACTTGAAGAGTGTTACAGAATTAGGAGAGTCTTTGAAAATGGTGTTGGGCAAGA

AAGAAACCTTGGTAGAAGATAAACTGAGTCTTCTGAACAGTAACTGGATAGCTGTCACCTCCAGAGTAGAAGAATGGCTAAATCTTTTGTTG GAATACC

AGAAACACATGGAAACCTTTGATCAGAACATAGAACAAATCACAAAGTGGATCATTCATGCAGATGAACTTTTAGATGAGTCTGAAAAGAAGAAACCACA

ACAAAAGGAAGACATTCTTAAG CGTTTAAAGGCTGAAATGAATGACATGCGCCCAAAGGTGGACTCCACACGTGACCAAGCAGCAAAATTGATGGCAAA

CCGCGGTGACCACTGCAGGAAAGTAGTAGAGCCCCAAATCTCTGAGCTCAACCGTCGATTTGCAGCTATTTCTCACAGAATTAAGACTGGAAAG GCCTC

CATTCCTTTGAAGGAATTGGAGCAGTTTAACTCAGATATACAAAAATTGCTTGAACCACTGGAGGCTGAAATTCAGCAGGGGGTGAATCTGAAAGAGGAA

GACTTCAATAAAGATATG AGTGAAGACAATGAGGGTACTGTAAATGAATTGTTGCAAAGAGGAGACAACTTACAACAAAGAATCACAGATGAGAGAAAG

CGAGAGGAAATAAAGATAAAACAGCAGCTGTTACAGACAAAACATAATGCTCTCAAG GATTTGAGGTCTCAAAGAAGAAAAAAGGCCCTAGAAATTTCT

CACCAGTGGTATCAGTACAAGAGGCAGGCTGATGATCTCCTGAAATGCTTGGATGAAATTGAAAAAAAATTAGCCAGCCTACCTGAACCCAGAGATGAAA

GAAAATTAAAG GAAATTGATCGTGAATTGCAGAAGAAGAAAGAGGAGCTGAATGCAGTGCGCAGGCAAGCTGAGGGCTTGTCTGAGAATGGGGCCGCAA

TGGCAGTGGAGCCAACTCAGATCCAGCTCAGCAAGCGCTGGCGGCAAATTGAGAGCAATTTTGCTCAGTTTCGAAGACTCAACTTTGCACAAATT CACA

CTCTCCATGAAGAAACTATGGTAGTGACGACTGAAGATATGCCTTTGGATGTTTCTTATGTGCCTTCTACTTATTTGACCGAGATCAGTCATATCTTACA

AGCTCTTTCAGAAGTTGATCATCTTCTAAATACTCCTGAACTCTGTGCTAAAGATTTTGAAGATCTTTTTAAGCAAGAGGAGTCTCTTAAG AATATAAA

AGACAATTTGCAACAAATCTCAGGTCGGATTGATATTATTCACAAGAAGAAGACAGCAGCCTTGCAAAGTGCCACCTCCATGGAAAAGGTGAAAGTACAG

GAAGCCGTGGCACAGATGGATTTCCAGGGGGAAAAACTTCATAGAATGTACAAGGAACGACAAGG GCGATTCGACAGATCAGTTGAAAAATGGCGACAC

TTTCATTATGATATGAAGGTATTTAATCAATGGCTGAATGAAGTTGAACAGTTTTTCAAAAAGACACAAAATCCTGAAAACTGGGAACATGCTAAATACA

AATGGTATCTTAAG GAACTCCAGGATGGCATTGGGCAGCGTCAAGCTGTTGTCAGAACACTGAATGCAACTGGGGAAGAAATAATTCAACAGTCTTCAA

AAACAGATGTCAATATTCTACAAGAAAAATTAGGAAGCTTGAGTCTGCGGTGGCACGACATCTGCAAAGAGCTGGCAGAAAGGAGAAAGAG GATTGAAG

AACAAAAGAATGTCTTGTCAGAATTTCAAAGAGATTTAAATGAATTTGTTTTGTGGCTGGAAGAAGCAGATAACATTGCTATTACTCCACTTGGAGATGA

GCAGCAGCTAAAAGAACAACTTGAACAAGTCAAG TTACTGGCAGAAGAGTTGCCCCTGCGCCAGGGAATTCTAAAACAATTAAATGAAACAGGAGGAGC

AGTACTTGTAAGTGCTCCCATAAGGCCAGAAGAGCAAGATAAACTTGAAAAGAAGCTCAAACAGACAAATCTCCAGTGGATAAAG GTCTCCAGAGCTTT

ACCTGAGAAACAAGGAGAGCTTGAGGTTCACTTAAAAGATTTTAGGCAGCTTGAAGAGCAGCTGGATCACCTGCTTCTGTGGCTCTCTCCTATTAGAAAC

CAGTTGGAAATTTATAACCAACCAAGTCAGGCAGGACCGTTTGACATAAAG GAGATTGAAGTAACAGTTCACGGTAAACAAGCGGATGTGGAAAGGCTT

TTGTCGAAAGGGCAGCATTTGTATAAGGAAAAACCAAGCACTCAGCCAGTGAAG AGGAAGTTAGAAGATCTGAGGTCTGAGTGGGAGGCTGTAAACCAT

TTACTTCGGGAGCTGAGGACAAAGCAGCCTGACCGTGCCCCTGGACTGAGCACTACTGGAGCCT - UNI33489 - CTGCCAGTCAGACTGTTACTCT

AGTGACACAATCTGTGGTTACTAAGGAAACTGTCATCTCCAAACTAGAAATGCCATCTTCTTTGCTGTTGGAGGTACCTGCACTGGCAGACTTCAACCGA

GCTTGGACAGAACTTACAGACTGGCTGTCTCTGCTTGATCGAGTTATAAAATCACAGAGAGTGATGGTGGGTGATCTGGAAGACATCAATGAAATGATCA

TCAAACAGAAG - UNI32672 - GCAACACTGCAAGATTTGGAACAGAGACGCCCCCAATTGGAAGAACTCATTACTGCTGCCCAGAATTTGAAAAAC

AAAACCAGCAATCAAGAAGCTAGAACAATCATTACTGATCGAA TTGAAAGAATTCAGATTCAGTGGGATGAGGTTCAAGAACAGCTGCAGAACAGGAGA

CAACAGTTGAATGAAATGTTAAAGGATTCAACACAATGGCTGGAAGCTAAGGAAGAAGCCGAACAGGTCATAGGACAGGTCAGAGGCAAGCTTGACTCAT

GGAAAGAAGGTCCTCACACAGTAGATGCAATCCAAAAGAAGATCACAGAAACCAAG CAGTTGGCCAAAGACCTCCGTCAACGGCAGATAAGTGTAGACG

TGGCAAATGATTTGGCACTGAAACTTCTTCGGGACTATTCTGCTGATGATACCAGAAAAGTACACATGATAACAGAGAATATCAATACTTCTTGGGGAAA

CATTCATAAAAG AGTAAGTGAGCAAGAGGCTGCTTTGGAAGAAACTCATAGATTACTGCAGCAGTTCCCTCTGGACCTGGAGAAGTTTCTTTCCTGGAT

TACGGAAGCAGAAACAACTGCCAATGTCCTACAGGACGCTTCCCGTAAGGAGAAGCTCCTAGAAGACTCCAGGGGAGTCAGAGAGCTGATGAAACCATGG

CAA GATCTCCAAGGAGAAATTGAAACTCACACAGATATCTATCACAATCTTGATGAAAATGGCCAAAAAATCCTGAGATCCCTGGAAGGTTCGGATGAA

GCACCCCTGTTACAAAGACGTTTGGATAACATGAATTTCAAGTGGAGTGAACTTCAGAAAAAGTCTCTCAACATTAG GTCCCATTTGGAAGCAAGTTCT

GACCAGTGGAAGCGTTTGCATCTTTCTCTTCAGGAACTTCTTGTTTGGCTACAGCTGAAAGATGATGAACTGAGCCGTCAGGCACCCATCGGTGGTGATT

TCCCAGCAGTTCAGAAGCAGAATGATATACATAGG GCCTTCAAGAGGGAATTGAAAACTAAAGAACCTGTAATCATGAGTACTCTGGAGACTGTGAGAA

TATTTCTGACAGAGCAGCCTTTGGAAGGACTAGAGAAACTCTACCAGGAGCCCAGAG AACTGCCTCCTGAAGAAAGAGCTCAGAATGTCACTCGGCTCC

TACGAAAGCAGGCTGAAGAGGTCAACGCTGAATGGGACAAATTGAACCTGCGCTCAGCTGATTGGCAGAGAAAAATAGATGAAGCTCTTGAAAGACTCCA

GGAACTTCAGGAAGCTGCCGATGAACTGGACCTCAAGTTGCGCCAAGCTGAGGTGATCAAGGGATCCTGGCAGCCAGTGGGGGATCTCCTCATTGACTCT

CTGCAAGATCACCTTGAAAAAGTCAAG GCACTTCGGGGAGAAATTGCACCTCTTAAAGAGAATGTCAATCGTGTCAATGACCTTGCACATCAGCTGACC

ACACTGGGCATTCAGCTCTCACCTTATAACCTCAGCACTTTGGAAGATCTGAATACCAGATGGAGGCTTCTACAG - UNI35867 - GTGGCTGTGGA

GGACCGTGTCAGACAGCTGCATGAAGCCCACAGGGACTTTGGTCCTGCATCCCAGCACTTCCTTTCCA CTTCAGTTCAGGGTCCCTGGGAGAGAGCCAT

CTCACCAAACAAAGTGCCCTACTATATCAA - UNI22807 - - UNI31850 - CCACGAGACCCAAACCACTTGTTGGGACCACCCCAAAATGACA

GAGCTCTACCAGTCTTTAG - UNI6101 - CTGACCTGAATAATGTCAGGTTCTCCGCGTATAGGACTGCCATGAAGCTCAGAAGGCTCCAGAAGGCC

CTTTGCT - UNI22934 - TGGATCTCTTGAGCCTGTCAGCTGCATGTGATGCCCTGGACCAGCACAACCTCAAGCAAAATGACCAGCCCATGGATAT

CCTGCAGATAATTAACTGTTTGACTACAATTTATGATCGTCTGGAGCAAGAGCACAACAATCTGGTCAATGTCCCTCTCTGTGTGGATATGTGTCTCAAC

TGGCTTCTCAATGTTTATGATAC - UNI23949 - GGGACGAACAGGGAGGATCCGTGTCCTGTCTTTTAAAACTGGCATCATTTCTCTGTGTAAAGC

ACACTTGGAAGACAAGTACAGAT ACCTTTTCAAGCAAGTGGCAAGTTCAACTGGCTTTTGTGACCAGCGTAGGCTGGGTCTTCTTCTGCATGATTCTAT

TCAAATCCCAAGACAGTTGGGTGAAGTTGCTTCCTTTGGGGGCAGTAACATTGAGCCGAGTGTCAGGAGCTGCTTCCAATTT - UNI4390 - GCCAA

TAATAAACCTGAGATTGAAGCTGCTCTCTTCCTTGACTGGATGCGCCTGGAACCCCAGTCTATGGTGTGGCTGCCCGTCTTGCACAGAGTGGCTGCTGCT

GAAACTGCCAAGCATCAAGCCAAGTGTAACATCTGTAAGGAGTGTCCAATCATTGGATTCAG GTACAGAAGCCTAAAGCATTTTAATTATGACATCTGC

CAAAGTTGCTTTTTTTCTGGCCGAGTTGCAAAGGGCCATAAAATGCACTACCCCATGGTAGAGTATTGCACTCCG ACTACATCCGGAGAAGATGTTCGC

GACTTCGCCAAGGTACTAAAAAACAAATTTCGAACCAAAAGGTATTTTGCGAAGCATCCCCGAATGGGCTACCTGCCAGTGCAGACTGTGTTAGAGGGGG

ACAACATGGAAAC TCCCGTTACTCTGATCAACTTCTGGCCAGTAGATTCTGC GCCTGCCTCGTCCCCCCAGCTTTCACACGATGATACTCATTCACGC

ATTGAACATTATGCTAGCAG GCTAGCAGAAATGGAAAACAGCAATGGATCTTATCTAAATGATAGCATCTCTCCTAATGAGAGCAT AGATGATGAACA

TTTGTTAATCCAGCATTACTGCCAAAGTTTGAACCAGGACTCCCCCCTGAGCCAGCCTCGTAGTCCTGCCCAGATCTTGATTTCCTTAGAGAGTGAGGAA

AGAGGGGAGCTAGAGAGAATCCTAGCAGATCTTGAGGAAGAAAACAG GAATCTGCAAGCAGAATATGATCGCCTGAAGCAGCAGCATGAGCATAAAGGC

CTGTCTCCACTGCCATCTCCTCCTGAGATGATGCCCACCTCTCCTCAGAGTCCCAGGGATGCTGAGCTCATTGCTGAGGCTAAGCTACTGCGCCAACACA

AAGGACGCCTGGAAGCCAGGATGCAAATCCTGGAAGACCACAATAAACAGCTGGAGTCTCAGTTACATAGACTGAGACAGCTCCTGGAGCAG CCCCAGG

CTGAAGCTAAGGTGAATGGCACCACGGTGTCCTCTCCTTCCACCTCTCTGCAGAGGTCAGATAGCAGTCAGCCTATGCTGCTCCGAGTGGTTGGCAGTCA

AACTTCAGAATCTATGG - UNI5399 - GTGAGGAAGATCTTCTGAGTCCTCCCCAGGACACAAGCACAGGGTTAGAAGAAGTGATGGAGCAACTCAA

CAACTCCTTCCCTAGTTCAAGAG - UNI9581 - GAAGAAATGCCCCCGGAAAGCCAATGAGAGAG GACACAATGTAGGAAGCCTTTTCCACATGGC

AGATGATTTGGGCAGAGCGATGGAGTCCTTAGTTTCAGTCATGACAGATGAAGAAGGAGCAGAATAAATGTTTTACAACTCCTGATTCCCGCATGGTTTT

TATAATATTCGTACAACAAAGAGGATTAGACAGTAAGAGTTTACAAGAAATAAAATCTATATTTTTGTGAAGGGTAGTGGTACTATACTGTAGATTTCAG

TAGTTTCTAAGTCTGTTATTGTTTTGTTAACAATGGCAGGTTTTACACGTCTATGCAATTGTACAAAAAAGTTAAAAGAAAACATGTAAAATCTTGATAG

CTAAATAACTTGCCATTTCTTTATATGGAACGCATTTTGGGTTGTTTAAAAATTTATAACAGTTATAAAGAAAGATTGTAAACTAAAGTGTGCTTTATAA

AAAAAGTTGTTTATAAAAACCCCTAAACAAACACACACGCACACACACACACACACACACACACACACACACACGCACACATACATGCACGAACCCACCA

CACACACACACACACACACACACACTGAGGCAGCACATTGTTTTGCATTACTTTAGCGTGGTATTCATATGGAATTCATGACGTTTTTTTATTTTCTTGC

ATACGAACCCCACCAAATGACTGCTTCATATTGCTCTTTTGAGAATTGTTGACTGAGTGGGGCTGGCTATGGGCTTTCATTTTATACATCTATATGTCTA

CAAGTATATAAATACTATAGGTATATAGATAAATAGATATGAAGTTACTTCTTCAAATGTTCTTGCCACTTCCTAATGGAAATTGCTTCTAGTCATCTGG

GCTTATCTGCTTGGGCAAGAGTGAATTTTCCCTGGAGCCCAAAGCCAGGAGACTACCGCCACACTAAAATATTGTCTAGGGCTCCAGATGTTTCTAGTTT

TAAACTTTCCACTGAGAGCTAGAGGATTCATTTTTTTCAAGGAACATGCGAATGAATACACAGGACTTACTATCATAGTAATTTGTTGGCTGATATATTC

AACTTCCTACTGTTGGGTTATATTTAATGATGTTTCTGCAATAGAACATCAGATGACATTTTTAACTCCCAGACAGTAGGAGGAAGATGGTAGGAGCTAA

AGGTTGCGGCTCCTCAGTCAATTTATATGAGGGGAGCAACAACTCTGTAAAAGAATGGATGAATATTTACAACTATACATATAAACATCTCTATAATTAC

AACTAAATTGTTCTGCCCTCTTCATAAACTCAACCTGAAGTGGGTGGTTTTGTTGTTGTTGTTGTTGTTGTTGTTGATGATGATGATGAATTTTAGATTT

TAGATTTTTTGGGTTTTTTTTTCTTCATTGTGATGATTTTTTTTTTTAATGCTGCAAGACTTAGGATTACTGTTAAGAAAGTAACCCAATCACATTGTGA

CCCTGGTGAATATCAGTCCAGAAGCCCATGAACTGCATTTGTCTCCTTTGCATTGGTTTCCCTGCAAGTAACTCCACACAGGATTGTGGGTGAGAAGGCA

CAGTGGTTGGAAAGTTTTGAGAGCAAAAGCGTCTCCAAACTCTCTGGTCTAGTTGACGGGCTGAAATGTCTAAACAAATGCAAGTCATTGAACCAGGAGA

AAAAGTGCAACAGAAAGCTAAGGACTGCTAGGAAGAGCTTTACTCCTCTCATGCCAGTTTCTTCTTCTTAGCATTTAAAGAGCATTCTCTCAATAGAAAT

CACTGTCCTATCATTTTGCAAATCTGTTACCTCTAACGTCAAGTGTAATTAACTTCTAGCGAGTGGGTTTTGTCCATTATTAATTGTAATTAACATCAAA

CACAGCTTCTCATGCTATTTCTACCTCACTTTGGTTTTGGGGTGTTTCTAGTAATTGTGCACACCTAATTTCACAACTTCACCACTTGTCTGTTGTGTGG

ACACCAGTTTCCTTTTTTCATTTATAATTTCCAAAAGAAAACCCAAAGCTCTAAGATAACAAATTGAAATTTGGTTCTGGTCTTGCTTTCTCTCTCTCTC

TCTCCTTTATGTGGCACTGGGCATTTTCTTTATCCAAGGATTTGTTTTCACCAAGATTTAAAACAAGGGGTTCCTTTCCTACTAAGAAGTTTTAAGTTTC

ATTCTAAAATCCAAGGTAGATAGAGTGCATAGTTTTGTTTTAATCTTTTCGTTTTATCTTTTAGATATTAGTTCTGGAGTGAATCTATCAAAATATTTGA

ATAAAAACTGAGAGCTTTATTGCTGATTTTAAGCATAATTTGGACATCATTTCATGTTCTTTATAACCATCAAGTATTAAAGTGTAAATCATAATCAGTG

TAACTGAAGCATAATCATCACATGGCATGTATCATCATTGTCTCCAGGTACTGGACTCTTACTTGAGTATCATAATAGATTGTGTTTTAACACCAACACT

GTAACATTTACTAATTATTTTTTTAAACTTCAGTTTTACTGCATTTTCACAACATATCAGATTTCACCAAATATATGCCTTACTATTGTATTATATTACT

GCTTTACTGTGTATCTCAATAAAGCACGCAGTTATGTTACAAAAAAATACCTATTGGTTGGACTGTAGTAGTTTATTTTTTTCTTTTAAGTTACTTCGTT

TGTCTTTCTATGCATTTTTGTTTTGGGGTGGAGGCTGTCACTGTGTTACATGGGTACCCTTGAACTCCTAGATTTAAGTGTTCCATCCTGCCTTAGCCTC

TGAAATAGCTGGGACTGCAAGGTCATGCCCTGCTTTATGTATTCAAGAACTCTTAAGAATATACAGATCATATATATGTATATATATACACACATATAAT

GATATAAATATGCATTTAACTACACATGTGTGTCAATGACATAAAAGCAGAAAGGGGACTATTTGAGGGAGCAAGGGGACTGTTGAGGGGAGAGGATAGA

TGAATGATCATAGTGAGAAGGAATTTGAGCAAGGCACAATGATATGTATGCATGACAGTGTCATAATGAAAGCCATTATTTCTGACACTAAAACAAATGA

TTTATATTTTAAAAATGCATGACAGTGCAGACCCTGTAATATCGACACTCAAAAGACAAAGGCGGGAGGGTCACCTTAAAGTCAAGACCAGCCAAGTCTA

TGTAAGGAGTCTTGGGCCAGCCACAGGGAGATCCTGTCCCAAGAAAACAAAACCCAGCACGAGTATATGTACTTGATGCATATGCTGTGTGCTAATTCCA

GTGTTCAGGTGCCATGGGGCATTTATGATTGTCCAAAGACAACAATTGTGCCATTCCTCATCGTTCGCCTTATGTGAGACAGGGTCTCTGCTTTTCACCA

CTTTGGATGCCAGGTGGGGTGTCCTGCAAGCTTTTAGGGATTCTTCTGCCTTTCATCTCACCATAGGAGAACTGTAATTGCAGTCATGGAGTACCTACAC

CAGGCTTTCCATGTATCTTGGGGATTTAAACTCAGGGAATCATGCTTGTATGGCAAGCACTTTACCCACTGAGCCATCTCATCCGTGGAGATATTTCCCT

TTCAATTTACAAAATGTTTTCTCATTCTCTCCTTCCAAATCATACATATTCTAATATGTATAAGCTATACTATTACAGGGCATTCTTGTGGCATGTGTTG

CTCTTCAGTCTTTGCCATAAAAAGTAGCAAACAATATTAC
Translation
MLWWEEVEDCY - UNI30637 - YEREDVQKKTFTKWINAQFSK FGKQHIDNLFSDLQDGKRLLDLLEGLTGQKL PKEKGSTRVHALNNVNKALRV

LQKNN VDLVNIGSTDIVDGNHKLTLGLIWNIILHWQ VKNVMKTIMAGLQQTNSEKILLSWVRQSTRNYPQVNVINFTSSWSDGLALNALIHSHR RPD

LFDWNSVVSQHSATQRLEHAFNIAKCQLGIEKLLDPED DVATTYPDKKSILMYITSLFQVLPQQVSIEAIQEVEMLPRTSSKVTREEHFQLHHQMHYSQ

Q ITVSLAQGYEQTSSSPKPRFKSYAFTQAAYVATSDSTQSPYPSQ HLEAPRDKSLDSSLMETEVNLDSYQTALEEVLSWLLSAEDTLRAQGEISNDVE

EVKEQFHAHE GFMMDLTSHQGLVGNVLQLGSQLVGKGKLSEDEEAEVQEQMNLLNSRWECLRVASMEKQSK KLHKVLMDLQNQKLKELDDWLTKTEER

TKKMEEEPFGPDLEDLKCQVQQHK VLQEDLEQEQVRVNSLTHMVVVVDESSGDHATAALEEQLK VLGDRWANICRWTEDRWIVLQDILLKWQHFTEEQ

 CLFSTWLSEKEDAMKNIQTSGFKDQNEMMSSLHKIS TLKIDLEKKKPTMEKLSSLNQDLLSALKNKSVTQKMEIWMENFAQRWDNLTQKLEKSSAQ I

SQAVTTTQPSLTQTTVMETVTMVTTREQIMVKHAQEELPPPPPQKKRQITVDSELRKR RLDVDITELHSWITRSEAVLQSSEFAVYRKEGNISDLQEKV

N AIAREKAEKFRKLQDASRSAQALVEQMANE EGVNAESIRQASEQLNSRWTEFCQLLSERVNWLEYQTNIITFYNQLQQLEQMTTTAENLLKTQSTTL

SEPTAIKSQLKICK DEVNRLSALQPQIEQLKIQSLQLKEKGQGPMFLDADFVAFTNHFNHIFDGVRAKEKELQTI IFDTLPPMRYQETMSSIRTWIQQ

SESKLSVPYLSVTEYEIMEERLGKLQ ALQSSLKEQQNGFNYLSDTVKEMAKKAPSEICQKYLSEFEEIEGHWKKLSSQLVESCQKLEEHMNKLRKFQ N

HIKTLQKWMAEVDVFLKEEWPALGDAEILKKQLKQCR LLVGDIQTIQPSLNSVNEGGQKIKSEAELEFASRLETELRELNTQWDHICRQ VYTRKEALK

AGLDKTVSLQKDLSEMHEWMTQAEEEYLERDFEYKTPDELQTAVEEMK RAKEEALQKETKVKLLTETVNSVIAHAPPSAQEALKKELETLTTNYQWLCT

RLNGKCKTLE EVWACWHELLSYLEKANKWLNEVELKLKTMENVPAGPEEITEVLE SLENLMHHSEENPNQIRLLAQTLTDGGVMDELINEELETFNSR

WRELHEE AVRKQKLLEQSIQSAQEIEKSLHLIQESLEFIDKQLAAYITDKVDAAQMPQEAQ KIQSDLTSHEISLEEMKKHNQGKDANQRVLSQIDVAQ

 KKLQDVSMKFRLFQKPANFEQRLEESKMILDEVKMHLPALETKSVEQEVIQSQLSHCV NLYKSLSEVKSEVEMVIKTGRQIVQKKQTENPKELDERVT

ALKLHYNELGAK VTERKQQLEKCLKLSRKMRKEMNVLTEWLAATDTELTKRSAVEGMPSNLDSEVAWGK ATQKEIEKQKAHLKSVTELGESLKMVLGK

KETLVEDKLSLLNSNWIAVTSRVEEWLNLLL EYQKHMETFDQNIEQITKWIIHADELLDESEKKKPQQKEDILK RLKAEMNDMRPKVDSTRDQAAKLM

ANRGDHCRKVVEPQISELNRRFAAISHRIKTGK ASIPLKELEQFNSDIQKLLEPLEAEIQQGVNLKEEDFNKDM SEDNEGTVNELLQRGDNLQQRITD

ERKREEIKIKQQLLQTKHNALK DLRSQRRKKALEISHQWYQYKRQADDLLKCLDEIEKKLASLPEPRDERKLK EIDRELQKKKEELNAVRRQAEGLSE

NGAAMAVEPTQIQLSKRWRQIESNFAQFRRLNFAQI HTLHEETMVVTTEDMPLDVSYVPSTYLTEISHILQALSEVDHLLNTPELCAKDFEDLFKQEES

LK NIKDNLQQISGRIDIIHKKKTAALQSATSMEKVKVQEAVAQMDFQGEKLHRMYKERQG GRFDRSVEKWRHFHYDMKVFNQWLNEVEQFFKKTQNPE

NWEHAKYKWYLK ELQDGIGQRQAVVRTLNATGEEIIQQSSKTDVNILQEKLGSLSLRWHDICKELAERRKR RIEEQKNVLSEFQRDLNEFVLWLEEAD

NIAITPLGDEQQLKEQLEQVK LLAEELPLRQGILKQLNETGGAVLVSAPIRPEEQDKLEKKLKQTNLQWIK VSRALPEKQGELEVHLKDFRQLEEQLD

HLLLWLSPIRNQLEIYNQPSQAGPFDIK EIEVTVHGKQADVERLLSKGQHLYKEKPSTQPVK RKLEDLRSEWEAVNHLLRELRTKQPDRAPGLSTTGA

S - UNI33489 - SASQTVTLVTQSVVTKETVISKLEMPSSLLLEVPALADFNRAWTELTDWLSLLDRVIKSQRVMVGDLEDINEMIIKQK - UNI3

2672 - ATLQDLEQRRPQLEELITAAQNLKNKTSNQEARTIITDRI IERIQIQWDEVQEQLQNRRQQLNEMLKDSTQWLEAKEEAEQVIGQVRGKLDS

WKEGPHTVDAIQKKITETK QLAKDLRQRQISVDVANDLALKLLRDYSADDTRKVHMITENINTSWGNIHKR RVSEQEAALEETHRLLQQFPLDLEKFL

SWITEAETTANVLQDASRKEKLLEDSRGVRELMKPWQ DLQGEIETHTDIYHNLDENGQKILRSLEGSDEAPLLQRRLDNMNFKWSELQKKSLNIR RSH

LEASSDQWKRLHLSLQELLVWLQLKDDELSRQAPIGGDFPAVQKQNDIHR AFKRELKTKEPVIMSTLETVRIFLTEQPLEGLEKLYQEPRE ELPPEER

AQNVTRLLRKQAEEVNAEWDKLNLRSADWQRKIDEALERLQELQEAADELDLKLRQAEVIKGSWQPVGDLLIDSLQDHLEKVK ALRGEIAPLKENVNRV

NDLAHQLTTLGIQLSPYNLSTLEDLNTRWRLLQ - UNI35867 - VAVEDRVRQLHEAHRDFGPASQHFLST TSVQGPWERAISPNKVPYYIN - U

NI22807 - - UNI31850 - NHETQTTCWDHPKMTELYQSLA - UNI6101 - ADLNNVRFSAYRTAMKLRRLQKALCL - UNI22934 - LD

LLSLSAACDALDQHNLKQNDQPMDILQIINCLTTIYDRLEQEHNNLVNVPLCVDMCLNWLLNVYDT - UNI23949 - TGRTGRIRVLSFKTGIISLC

KAHLEDKYRY YLFKQVASSTGFCDQRRLGLLLHDSIQIPRQLGEVASFGGSNIEPSVRSCFQF - UNI4390 - ANNKPEIEAALFLDWMRLEPQSM

VWLPVLHRVAAAETAKHQAKCNICKECPIIGFR RYRSLKHFNYDICQSCFFSGRVAKGHKMHYPMVEYCTP TTSGEDVRDFAKVLKNKFRTKRYFAKH

PRMGYLPVQTVLEGDNMET TPVTLINFWPVDSA APASSPQLSHDDTHSRIEHYASR RLAEMENSNGSYLNDSISPNESI IDDEHLLIQHYCQSLNQ

DSPLSQPRSPAQILISLESEERGELERILADLEEENR RNLQAEYDRLKQQHEHKGLSPLPSPPEMMPTSPQSPRDAELIAEAKLLRQHKGRLEARMQIL

EDHNKQLESQLHRLRQLLEQ PQAEAKVNGTTVSSPSTSLQRSDSSQPMLLRVVGSQTSESMG - UNI5399 - GEEDLLSPPQDTSTGLEEVMEQLN

NSFPSSRG - UNI9581 - GRNAPGKPMRE DTM

For any suggestions or comments, please send an email to unitrap@crg.es