Tools for the biologist enabling optimized use of gene trap clones

  Homepage | Blast Search | GO Search | Advanced Search | About

Gene ENSMUSG00000038664 (Herc1)
Chromosomal location
Chr 9: 66198331 - 66356582 (+)
Description
hect (homologous to the E6-AP (UBE3A) carboxyl terminus) domain and RCC1 (CHC1)-like domain (RLD) 1 Gene [Source:MGI (curated);Acc:Herc1-001]
UniGene
Mm.244179 
MGI
MGI:2384589 
Uniprot/SPTREMBL
Q99KS8 Q8BNF7 Q3UF38 Q4VBD0 Q8BNK4 Q8BR49 Q8CBQ0 Q9CS23 
Human Ortholog
ENSG00000103657 (HERC1)
Omim not available
UniTrap UNI16525
Vector Insertion
Chr 9: 66198429 - 66219618
Public Clones (sanger) IST11408F8 (tigm) IST14147B11 (tigm) IST10469E11 (tigm)
Private Clones OST459170 (lexicon) OST443596 (lexicon) OST325604 (lexicon) OST223193 (lexicon)
OST63320 (lexicon)
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI4010
Vector Insertion
Chr 9: 66219592 - 66220462
Public Clones AR0038 (sanger) AQ0503 (sanger)
Private Clones not available
Severity of mutation (?) Insertion after 0% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI35616
Vector Insertion
Chr 9: 66224220 - 66232799
Public Clones IST11795A3 (tigm)
Private Clones not available
Severity of mutation (?) Insertion after 28% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI6246
Vector Insertion
Chr 9: 66258997 - 66261881
Public Clones CSD016 (baygenomics)
Private Clones not available
Severity of mutation (?) Insertion after 26% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI7800
Vector Insertion
Chr 9: 66309728 - 66310635
Public Clones RRN270 (baygenomics)
Private Clones not available
Severity of mutation (?) Insertion after 61% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI4352
Vector Insertion
Chr 9: 66332845 - 66333873
Public Clones AF0417 (sanger)
Private Clones not available
Severity of mutation (?) Insertion after 84% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI30919
Vector Insertion
Chr 9: 66346564 - 66348982
Public Clones IST13645D6 (tigm) IST10855F12 (tigm) IST14818B7 (tigm) IST14517A2 (tigm)
IST10197H4 (tigm) IST14577C5 (tigm) IST12425D11 (tigm) IST11427F2 (tigm)
IST14577D5 (tigm) IST14941B6 (tigm) IST12804G8 (tigm) IST14588G8 (tigm)
IST10315B5 (tigm)
Private Clones not available
Severity of mutation (?) Insertion after 94% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI29253
Vector Insertion
Chr 9: 66349890 - 66352394
Public Clones not available
Private Clones OST1829 (lexicon)
Severity of mutation (?) Insertion after 97% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

UniTrap UNI29195
Vector Insertion
Chr 9: 66352701 - 66355945
Public Clones not available
Private Clones OST6156 (lexicon)
Severity of mutation (?) Insertion after 99% of polypeptide chain
Proposed experimental design for vector insertion validation (?)

Show all transcripts and translations:
Transcript ENSMUST00000096525
Sequence
ATGGCGACTATGGTTCCACCAGTGAAACTGAAATGGCTTGAACATCTGAATAGCTCCTGGATCACAGAGGACAGTGAATCTATTGCTACAAGAGAGGGAG

TTACCGTTTTGTATTCTAAACTGATCAGCAATAAGGAAGTAGTACCTTTGCCTCAACAGGTTTTATGCCTCAAAGGACCACAGTTGCCTGATTTTGAACG

TGAGTCTCTTTCAAGTGATGAGCAGGACCATTATTTGGATGCCCTTCTTAGCAGCCAGCTAGCACTAGCAAAGATGGTATGTTCAGATTCTCCATTTGCT

GGGGCGCTAAGAAAACGACTGCTTGTACTCCAACGTGTCTTTTATGCACTTTCTAATAAGTACCATGACAAAGGCAAAGTGAAACAGCAGCAGCATTCTC

CGGAGAGCAGCTCTGGTTCAGCAGATGTCCATTCTGTCAGTGAACGCCCCCGGTCAAGCACTGATGCACTTATAGAAATGGGTGTCCGAACTGGTCTAAG

TTTGTTATTTGCACTTCTGAGACAGAGTTGGATGATGCCTGTGTCGGGACCTGGCCTCAGTCTCTGCAACGACGTTATTCATACTGCAATTGAAGTTGTG

AGCTCTTTGCCTCCATTGTCTTTAGCAAATGAAAGCAAGATTCCTCCTATGGGCTTGGACTGCTTATCACAAGTCACAACATTTCTTAAAGGAGTAACTA

TTCCCAATTCTGGGGCAGACACTTTAGGTCGTCGATTAGCTTCTGAGTTGCTGCTTGGCTTAGCAGCTCAGAGAGGCTCTTTGAGATACCTTCTTGAATG

GATAGAAATGGCTTTGGGGGCTTCAGCAGTTGTATATACTATG - UNI4010 - GAGACTTACAACCATCTGTAA
Translation
MATMVPPVKLKWLEHLNSSWITEDSESIATREGVTVLYSKLISNKEVVPLPQQVLCLKGPQLPDFERESLSSDEQDHYLDALLSSQLALAKMVCSDSPFA

GALRKRLLVLQRVFYALSNKYHDKGKVKQQQHSPESSSGSADVHSVSERPRSSTDALIEMGVRTGLSLLFALLRQSWMMPVSGPGLSLCNDVIHTAIEVV

SSLPPLSLANESKIPPMGLDCLSQVTTFLKGVTIPNSGADTLGRRLASELLLGLAAQRGSLRYLLEWIEMALGASAVVYTM - UNI4010 - ETYNHL

Transcript ENSMUST00000043618
Sequence
GCAAAAATATAAAGGACGCGGCTAGCGCGGCCGCTGGCCAGCCGGGTGCCGGGTGCTGGAGCCGCACCCCATCGCCGCCAGCTCCCGACCCACGTACG -

 UNI4010 - - UNI16525 - AACTAATGGCTGAAGAGTAAATCAACATGGCGACTATGGTTCCACCAGTGAAACTGAAATGGCTTGAACATCTGAA

TAGCTCCTGGATCACAGAGGACAGTGAATCTATTGCTACAAGAGAGGGAGTTACCGTTTTGTATTCTAAACTGATCAGCAATAAGGAAGTAGTACCTTTG

CCTCAACAGGTTTTATGCCTCAAAGGACCACAGTTGCCTGATTTTGAACGTGAGTCTCTTTCAAGTGATGAGCAGGACCATTATTTGGATGCCCTTCTTA

GCAGCCAGCTAGCACTAGCAAAGATGGTATGTTCAGATTCTCCATTTGCTGGGGCGCTAAGAAAACGACTGCTTGTACTCCAACGTGTCTTTTATGCACT

TTCTAATAAGTACCATGACAAAGGCAAAGTGAAACAGCAGCAGCATTCTCCGGAGAGCAGCTCTGGTTCAGCAGATGTCCATTCTGTCAGTGAACGCCCC

CGGTCAAGCACTGATGCACTTATAGAAATGGGTGTCCGAACTGGTCTAAGTTTGTTATTTGCACTTCTGAGACAGAGTTGGATGATGCCTGTGTCGGGAC

CTGGCCTCAGTCTCTGCAACGACGTTATTCATACTGCAATTGAAGTTGTGAGCTCTTTGCCTCCATTGTCTTTAGCAAATGAAAGCAAGATTCCTCCTAT

GGGCTTGGACTGCTTATCACAAGTCACAACATTTCTTAAAGGAGTAACTATTCCCAATTCTGGGGCAGACACTTTAGGTCGTCGATTAGCTTCTGAGTTG

CTGCTTGGCTTAGCAGCTCAGAGAGGCTCTTTGAGATACCTTCTTGAATGGATAGAAATGGCTTTGGGGGCTTCAGCAGTTGTATATACTATGGAGAAAA

ACAAACTACTGTCAAGCCAGGAAGGAATGATCAGCTTTGACTGCTTTATGGCTATATTAATGCAGATGAGGCGATCTTTG GGTTCATCTGCTGATCGGA

GTCAGTGGAGAGAACCAACTAGAACATCTGAAGGCTTATGTTCACTCTATGAGGCAGCATTATGTCTTTTTGAAGAG - UNI35616 - GTTTGCAGA

ATGGCTTCTGACTATTCAAGAACATGTGCTAGCCCAGATAGCATTCAGACTGGTGATGCTCCCATTGTTTCTGAAACCTGTGAGGTATATGTTTGGGGCA

GCAACAGCAGCCATCAGTTGGTAGAAGGCACACAGGAGAAAATACTACAACCCAAACTGGCTCCTAGTTTCTCTGATGCTCAGACC ATTGAAGCTGGAC

AGTATTGCACTTTTGTCATTTCTACAGATGGCTCTGTCAGAGCTTGTGGTAAAGGCAGCTATGGGAGACTGGGCCTCGGAGATTCCAATAATCAGTCTAC

CTTAAAAAAGTTAACTTTTGAGCCTCACCGGTCTATTAAGAAAGTTTCATCATCTAAAGGCTCTGATGGTCACACTTTAGCTTTTACTACAGAAGGTGAA

GTCTTCAGCTGGGGAGATGGTGACTATGGGAAACTAGGACATGGGAATAGTTCAACCCAGAAATACCCCAAGCTTATCCAGGGCCCACTACAGGGAAAG 

GTAGTAGTTTGTGTATCAGCTGGATACAGACATAGTGCTGCTGTCACAGAGGATGGTGAATTATATACTTGGGGTGAAGGAGACTTCGGAAGATTAG GT

CATGGTGACAGTAACAGCCGTAACATTCCAACATTAGTAAAAGACATTAGCAATGTAGGAGAGGTTTCTTGTGGAAGTTCACATACGATTGCTTTGTCCA

AAGATGGAAGAACTGTGTGGTCTTTTGGAGGAGGTGACAATG GCAAACTTGGCCATGGTGATACCAACAGAGTATATAAACCTAAAGTTATTGAAGCTT

TACAAGGAATGTTTATTCGCAAAGTCTGTGCTGGGAGCCAATCTTCACTTGCTTTGACATCAACAGGGCAG GTTTATGCTTGGGGCTGTGGGGCTTGTC

TAGGTTGTGGTTCTTCAGAAGCTACTGCTTTGAGACCCAAGCTCATTGAAGAATTGGCTGCCACAAGAATAGTTGATATTTCAATTGGAGATAGTCATTG

TTTGTCTCTTTCTCATG ATAATGAAGTTTATGCCTGGGGCAATAACTCCATGGGACAATGTGGCCAGGGAAATTCAACAGGTCCTATTACTAAACCAAA

GAAAGTGAGTGGCTTAGATGGCATAGCTATTCAGCAGATCTCAGCTGGAACATCACATAGTTTGGCATGGACTGCTCTCCCTAGGGACAG ACAAGTTGT

TGCATGGCACAGGCCTTATTGTGTAGATCTTGAAGAGAGTACCTTCTCACATCTGCGATCTTTTCTTGAGAGATACTGTGATAAAATAAACAGTGAGATT

CCCCCACTGCCTTTCCCTTCATCAAG GGAACACCATAATTTTCTTAAGCTGTGCCTGAAGTTGCTTTCAAATCACCTTGCTCTGGCCCTGGCGGGAGGG

GTAGCTACCAGTATTCTTGGGAGACAGGCAGGTCCACTTCGAAATTTGCTCTTCAGGCTGATGGATTCAACTGTCCCAGATGAAATCCAAGAG GTGGTA

ATTGAAACACTCTCAGTTGGAGCAACCATGCTGTTACCTCCATTAAGAGAACGGATGGAATTACTTCATTCTCTTTTACCCCAAGGACCTGATAGATGGG

AAAGCTTATCCAAAGGACAG AGAATGCAACTGGATATAATTCTGACAAGTTTACAAGATCATACCCATGTAGCCTCCCTTCTTGGCTATAGCTCACCTT

CCGATGCTGCTGACCTTTCGACTGTGTGCATGGGATATGGAAACCTGTCAGACCAACCATATGGTTCTCAGATCTGCCATCCAGACACTCATCTGGCCGA

AATTTTAATGAAGACTCTCTTAAGAAATTTAGGGTTTTATACG GATCAAGCATTTGGAGAGCTGGAAAAGAATAGTGATAAATATCTTCTTGGAACATC

ATCTTCAGAGAACAGTCAGCCTGCTCATCTTCATGAACTACTGTGTTCATTACAGAAACAACTGCTTGCTTTTTGCCACATTAATAATGTTACTGAG AA

CTCAAGCAGTGTGGCATTGCTTCATAAACATCTTCAGCTTTTGCTGCCTCATGCCACAGATATTTATTCACGTTCCGCAAATTTGCTTAAAGAAAGTCCT

TGGAATGGCAGTGTTGGAGAAAAATTGAGAG ATGTGATCTATGTGTCAGCTGCAGGCAGTATGCTCTGCCAGATTGTTAACTCCCTACTGTTACTCCCG

GTGTCAGTGGCTCGTCCTTTATTGAGTTACCTCCTCGACCTCTTGCCACCTCTTGATTGCCTTAATAGACTCCTGCCAGCTGCTGCTCTTTTAGAAGACC

AAGAATTACAGTGGCCTCTTCATG GGGGGCCAGAAGTAATTGACCCAGCTGGTGTGCCATTACCTCAACCAGCTCAATCTTGGGTATGGCTTGTGGATC

TGGAAAGAACAATTGCTCTCCTCATTGGCCGATGTCTTGGTGGCATGCTTCAAGGCTCCCCCGTGTCTCCAGAGGAACAGGATACTGCATATTGGATGAA

AACACCACTTTTCAGTGATGGTGTGGAAATGGACACCCCTCAGTTGG ATAAATGCATGAGCTGCCTACTAGAAGTAGCTCTTTCAGGA
Translation
 - UNI4010 - - UNI16525 - MATMVPPVKLKWLEHLNSSWITEDSESIATREGVTVLYSKLISNKEVVPLPQQVLCLKGPQLPDFERESLSSDE

QDHYLDALLSSQLALAKMVCSDSPFAGALRKRLLVLQRVFYALSNKYHDKGKVKQQQHSPESSSGSADVHSVSERPRSSTDALIEMGVRTGLSLLFALLR

QSWMMPVSGPGLSLCNDVIHTAIEVVSSLPPLSLANESKIPPMGLDCLSQVTTFLKGVTIPNSGADTLGRRLASELLLGLAAQRGSLRYLLEWIEMALGA

SAVVYTMEKNKLLSSQEGMISFDCFMAILMQMRRSL GSSADRSQWREPTRTSEGLCSLYEAALCLFEE - UNI35616 - VCRMASDYSRTCASPDS

IQTGDAPIVSETCEVYVWGSNSSHQLVEGTQEKILQPKLAPSFSDAQT IEAGQYCTFVISTDGSVRACGKGSYGRLGLGDSNNQSTLKKLTFEPHRSIK

KVSSSKGSDGHTLAFTTEGEVFSWGDGDYGKLGHGNSSTQKYPKLIQGPLQGK VVVCVSAGYRHSAAVTEDGELYTWGEGDFGRLG GHGDSNSRNIPT

LVKDISNVGEVSCGSSHTIALSKDGRTVWSFGGGDNG GKLGHGDTNRVYKPKVIEALQGMFIRKVCAGSQSSLALTSTGQ VYAWGCGACLGCGSSEAT

ALRPKLIEELAATRIVDISIGDSHCLSLSHD DNEVYAWGNNSMGQCGQGNSTGPITKPKKVSGLDGIAIQQISAGTSHSLAWTALPRDR RQVVAWHRP

YCVDLEESTFSHLRSFLERYCDKINSEIPPLPFPSSR REHHNFLKLCLKLLSNHLALALAGGVATSILGRQAGPLRNLLFRLMDSTVPDEIQE VVIET

LSVGATMLLPPLRERMELLHSLLPQGPDRWESLSKGQ RMQLDIILTSLQDHTHVASLLGYSSPSDAADLSTVCMGYGNLSDQPYGSQICHPDTHLAEIL

MKTLLRNLGFYT DQAFGELEKNSDKYLLGTSSSENSQPAHLHELLCSLQKQLLAFCHINNVTE NSSSVALLHKHLQLLLPHATDIYSRSANLLKESPW

NGSVGEKLRD DVIYVSAAGSMLCQIVNSLLLLPVSVARPLLSYLLDLLPPLDCLNRLLPAAALLEDQELQWPLHG GGPEVIDPAGVPLPQPAQSWVWL

VDLERTIALLIGRCLGGMLQGSPVSPEEQDTAYWMKTPLFSDGVEMDTPQLD DKCMSCLLEVALSG
Transcript ENSMUST00000042824
Sequence
AAAAATATAAAGGACGCGGCTAGCGCGGCCGCTGGCCAGCCGGGTGCCGGGTGCTGGAGCCGCACCCCATCGCCGCCAGCTCCCGACCCACGTACG - U

NI4010 - - UNI16525 - AACTAATGGCTGAAGAGTAAATCAACATGGCGACTATGGTTCCACCAGTGAAACTGAAATGGCTTGAACATCTGAATA

GCTCCTGGATCACAGAGGACAGTGAATCTATTGCTACAAGAGAGGGAGTTACCGTTTTGTATTCTAAACTGATCAGCAATAAGGAAGTAGTACCTTTGCC

TCAACAGGTTTTATGCCTCAAAGGACCACAGTTGCCTGATTTTGAACGTGAGTCTCTTTCAAGTGATGAGCAGGACCATTATTTGGATGCCCTTCTTAGC

AGCCAGCTAGCACTAGCAAAGATGGTATGTTCAGATTCTCCATTTGCTGGGGCGCTAAGAAAACGACTGCTTGTACTCCAACGTGTCTTTTATGCACTTT

CTAATAAGTACCATGACAAAGGCAAAGTGAAACAGCAGCAGCATTCTCCGGAGAGCAGCTCTGGTTCAGCAGATGTCCATTCTGTCAGTGAACGCCCCCG

GTCAAGCACTGATGCACTTATAGAAATGGGTGTCCGAACTGGTCTAAGTTTGTTATTTGCACTTCTGAGACAGAGTTGGATGATGCCTGTGTCGGGACCT

GGCCTCAGTCTCTGCAACGACGTTATTCATACTGCAATTGAAGTTGTGAGCTCTTTGCCTCCATTGTCTTTAGCAAATGAAAGCAAGATTCCTCCTATGG

GCTTGGACTGCTTATCACAAGTCACAACATTTCTTAAAGGAGTAACTATTCCCAATTCTGGGGCAGACACTTTAGGTCGTCGATTAGCTTCTGAGTTGCT

GCTTGGCTTAGCAGCTCAGAGAGGCTCTTTGAGATACCTTCTTGAATGGATAGAAATGGCTTTGGGGGCTTCAGCAGTTGTATATACTATGGAGAAAAAC

AAACTACTGTCAAGCCAGGAAGGAATGATCAGCTTTGACTGCTTTATGGCTATATTAATGCAGATGAGGCGATCTTTG GGTTCATCTGCTGATCGGAGT

CAGTGGAGAGAACCAACTAGAACATCTGAAGGCTTATGTTCACTCTATGAGGCAGCATTATGTCTTTTTGAAGAG - UNI35616 - GTTTGCAGAAT

GGCTTCTGACTATTCAAGAACATGTGCTAGCCCAGATAGCATTCAGACTGGTGATGCTCCCATTGTTTCTGAAACCTGTGAGGTATATGTTTGGGGCAGC

AACAGCAGCCATCAGTTGGTAGAAGGCACACAGGAGAAAATACTACAACCCAAACTGGCTCCTAGTTTCTCTGATGCTCAGACC ATTGAAGCTGGACAG

TATTGCACTTTTGTCATTTCTACAGATGGCTCTGTCAGAGCTTGTGGTAAAGGCAGCTATGGGAGACTGGGCCTCGGAGATTCCAATAATCAGTCTACCT

TAAAAAAGTTAACTTTTGAGCCTCACCGGTCTATTAAGAAAGTTTCATCATCTAAAGGCTCTGATGGTCACACTTTAGCTTTTACTACAGAAGGTGAAGT

CTTCAGCTGGGGAGATGGTGACTATGGGAAACTAGGACATGGGAATAGTTCAACCCAGAAATACCCCAAGCTTATCCAGGGCCCACTACAGGGAAAG GT

AGTAGTTTGTGTATCAGCTGGATACAGACATAGTGCTGCTGTCACAGAGGATGGTGAATTATATACTTGGGGTGAAGGAGACTTCGGAAGATTAG GTCA

TGGTGACAGTAACAGCCGTAACATTCCAACATTAGTAAAAGACATTAGCAATGTAGGAGAGGTTTCTTGTGGAAGTTCACATACGATTGCTTTGTCCAAA

GATGGAAGAACTGTGTGGTCTTTTGGAGGAGGTGACAATG GCAAACTTGGCCATGGTGATACCAACAGAGTATATAAACCTAAAGTTATTGAAGCTTTA

CAAGGAATGTTTATTCGCAAAGTCTGTGCTGGGAGCCAATCTTCACTTGCTTTGACATCAACAGGGCAG GTTTATGCTTGGGGCTGTGGGGCTTGTCTA

GGTTGTGGTTCTTCAGAAGCTACTGCTTTGAGACCCAAGCTCATTGAAGAATTGGCTGCCACAAGAATAGTTGATATTTCAATTGGAGATAGTCATTGTT

TGTCTCTTTCTCATG ATAATGAAGTTTATGCCTGGGGCAATAACTCCATGGGACAATGTGGCCAGGGAAATTCAACAGGTCCTATTACTAAACCAAAGA

AAGTGAGTGGCTTAGATGGCATAGCTATTCAGCAGATCTCAGCTGGAACATCACATAGTTTGGCATGGACTGCTCTCCCTAGGGACAG ACAAGTTGTTG

CATGGCACAGGCCTTATTGTGTAGATCTTGAAGAGAGTACCTTCTCACATCTGCGATCTTTTCTTGAGAGATACTGTGATAAAATAAACAGTGAGATTCC

CCCACTGCCTTTCCCTTCATCAAG GGAACACCATAATTTTCTTAAGCTGTGCCTGAAGTTGCTTTCAAATCACCTTGCTCTGGCCCTGGCGGGAGGGGT

AGCTACCAGTATTCTTGGGAGACAGGCAGGTCCACTTCGAAATTTGCTCTTCAGGCTGATGGATTCAACTGTCCCAGATGAAATCCAAGAG GTGGTAAT

TGAAACACTCTCAGTTGGAGCAACCATGCTGTTACCTCCATTAAGAGAACGGATGGAATTACTTCATTCTCTTTTACCCCAAGGACCTGATAGATGGGAA

AGCTTATCCAAAGGACAG AGAATGCAACTGGATATAATTCTGACAAGTTTACAAGATCATACCCATGTAGCCTCCCTTCTTGGCTATAGCTCACCTTCC

GATGCTGCTGACCTTTCGACTGTGTGCATGGGATATGGAAACCTGTCAGACCAACCATATGGTTCTCAGATCTGCCATCCAGACACTCATCTGGCCGAAA

TTTTAATGAAGACTCTCTTAAGAAATTTAGGGTTTTATACG GATCAAGCATTTGGAGAGCTGGAAAAGAATAGTGATAAATATCTTCTTGGAACATCAT

CTTCAGAGAACAGTCAGCCTGCTCATCTTCATGAACTACTGTGTTCATTACAGAAACAACTGCTTGCTTTTTGCCACATTAATAATGTTACTGAG AACT

CAAGCAGTGTGGCATTGCTTCATAAACATCTTCAGCTTTTGCTGCCTCATGCCACAGATATTTATTCACGTTCCGCAAATTTGCTTAAAGAAAGTCCTTG

GAATGGCAGTGTTGGAGAAAAATTGAGAG ATGTGATCTATGTGTCAGCTGCAGGCAGTATGCTCTGCCAGATTGTTAACTCCCTACTGTTACTCCCGGT

GTCAGTGGCTCGTCCTTTATTGAGTTACCTCCTCGACCTCTTGCCACCTCTTGATTGCCTTAATAGACTCCTGCCAGCTGCTGCTCTTTTAGAAGACCAA

GAATTACAGTGGCCTCTTCATG GGGGGCCAGAAGTAATTGACCCAGCTGGTGTGCCATTACCTCAACCAGCTCAATCTTGGGTATGGCTTGTGGATCTG

GAAAGAACAATTGCTCTCCTCATTGGCCGATGTCTTGGTGGCATGCTTCAAGGCTCCCCCGTGTCTCCAGAGGAACAGGATACTGCATATTGGATGAAAA

CACCACTTTTCAGTGATGGTGTGGAAATGGACACCCCTCAGTTGG ATAAATGCATGAGCTGCCTACTAGAAGTAGCTCTTTCAGGAAATGAAGAACAGA

AGCCTTTTGATTACAAATTGCGGCCTGAAGTTGCTGTCTATGTAGACTTGGCATTGGGTTGTTCTAAAGAGCCTGCCAGAAGCCTTTGGATCAGCATGCA

GGATTATGCTGTCAGTAAAG - UNI6246 - ATTGGGACAGTGCAACTTTAAGCAATGAGTCACTCTTGGACACTGTGTCTAGATTTGTTCTTGCAGC

ACTCCTCAAACACACAAACTTACTTAGTCAAGCATGTGGAGAAAGCCG ATACCAACCTGGTAAAAGCTTATCAGAAGTGTATCGTTGTGTATACAAAGT

TCGAAGTCGTTTACTTGCTTGCAAGAACCTTGAACTTATTCAAACCAGGTCATCATCACGAGACAGATGG ATCACAGATAACCAAGACTCTGCAGATGT

TGATCCTCAAGAACATTCCTTTACCCGGACCATTGATGAAGAAGCTGAAATGGAAGAGCTGGCTGAGAGAGACAGGGAGGATGGCCACCCAGAGCCAGAA

GATGAGGAAGAAGAGCGGGAACATGAAGTAATGACAGCTGGCA AAATCTTTCAGTGTTTCCTCTCAGCCCGAGAAGTAGCTCGTAGCCGAGACCGAGAT

AGAATGAACAGTGGGGCAGGGTCTGGGGTCCGAGCTGATGACCCACCTCCACAGTCTCAGCAGGAACGACGGGTCAGCACAGACCTTCCTGAGGGGCAGG

ATGTGTACACTGCTGCCTGCAACTCTGTGATCCACCGCTGTGCGCTCCTAATATTAGGAGTAAGTCCTGTGATTGATGAGCTTCAGAAGCGAAGGGAAGA

AGGGCAGTTACAGCAGCCTTCTGTGAGTGCCTCTGAAGGCACCGGACTTATGACCAG GAGTGAAAGTCTCACTGCTGAGAGCCGCCTAGTTCATGCAAG

TCCAAGTTACAGACTGATCAAATCAAGGAGTGAATCTGATCTGTCTCAGCCTGAATCAGATGAAGAGGGTTATGCACTG AGTGGCCGACGAAATGTTGA

CTTTGATTTGGCATCATCTCATAGGAAGAGAG GTCCTATGCATAGTCAATTAGAATCTCTGAGTGACTCTTGGACTCGCCTGAAACATACCAGAGATTG

GTTCTACAACTCCTCTTACTCCTTTGAGTCTGATTTTGACCTTACCAAGTCTTTGGGAGTTCACACATTGATTGAAAATGTTGTAAGTTTTGTGAGTGGA

GATGTGGGGAATGCTCCAGGTTTCAAGGAGCCGGAAGAAAGTATGTCTACAAGTCCCCAGGCCTCCATCATTGCAATGGAGCAGCAACAGCTAAGGGCAG

AG CTTCGTTTGGAGGCACTTCACCAGATCCTCACTCTATTGTCTGGCATGGAAGAAAAGGGTAACGTCTTGCTGACAGGAAGCAGATCAAGTTCAGGCT

TTCAGTCATCCACCCTGCTCACATCTGTGAGATTGCAGTTTCTAGCAGGGTGTTTCGGTTTAGGTACTGTTGGACACACGGGAACCAAAGGAGAGAGTGG

CCGACTGCATCACTATCAA GATGGAATCAGAGCAGCTAAAAGAAATATTCAGGTTGAAATCCAGGTGGCTGTTCATAAGATTTACCAACAATTGTCTGC

TACTCTAGAGAGAGCCCTTCAAGCAAATAAGCATCACATTG AAGCCCAGCAACGTCTTCTTTTGGTTACAGTTTTTGCCCTCAGTGTTCATTATCAGCC

AGTTGATGTTTCTTTGGCAATTTCCACTGGTCTGCTGAATGTATTGTCACAGTTATGTGGAACAGACACCATGTTAGGACAGCCCCTGCAATTGTTGCCA

AAGACTGGTGTTTCTCAGCTTAGCACAGCTTTGAAAGTGGCCAGTACAAGGTTGCTCCAGATTCTAGCCATCACTACTGG AACTTATGCTGATAAACTG

AGCCCCAAAGTAGTTCAGTCCTTGTTGGATCTACTCTGTAGTCAACTGAAGAATTTATTGTCTCAAACTGGTGTGCTGTTTATGGCCTCTTTTGGAGAAG

GAGAAGAGGGTGAAGAAGAAGAAAAAAAAGTTGACTCCAGTGGAGAAGCTGAGAAGAGAGATTTCAGAG CTGCTCTTAGGAAACAACATGCAGCTGAAC

TCCATCTAGGGGATTTTTTAGTTTTTCTTCGCAGAGTTGTGTCTTCAAAAGCAATTCAATCAAAAATGGCTTCCCCCAAATGGACAGAAGTGCTTCTAAA

CATAGCATCTCAGAAGTGTTCTTCAG GTATTCCTCTAGTTGGTAACTTAAGAACGAGGCTTCTTGCTCTTCACGTCCTTGAGGCTGTGCTACCAGCTTG

TGAATCTGGTGTAGAAGATGACCAAATGGCCCAG GTTGTTGAACGCTTATTTTCCCTTCTGTCGGATTGTATGTGGGAGACTCCCATTGCTCAGGCCAA

ACATGCTATTCAAATAAAGGAAAAAGAACAAGAAATAAAATTGCAG AAGCAAGGAGAGTTGGAGGAAGAGGATGAAAATCTCCCCATACAAGAAGTCTC

TTTTGATCCAGAGAAGGCTCAGTGTTGCATTGTGGAAAATGGACAGATTTTAACTCATGGCAGTGGAGGAAAAGGATATGGATTAGCATCTACTGGGGTG

ACTTCTGGGTGCTATCAGTGGAAG TTTTATATTGTGAAGGAAAATAGAGGTAATGAAGGTACATGTGTTGGAGTCTCTCGCTGGCCAGTACATGATTTT

AATCACCGCACTACCTCAGACATGTGGCTCTATAGGGCGTATAGTGGTAACCTCTATCACAATGGAGAACAGACTCTGACGCTGTCCAGCTTTACTCAAG

GGGATTTTATTACCTGTGTGTTAGACATGGAAGCCAGGACCATTTCCTTTGGGAAAAATGGAGAG GAACCCAAATTAGCCTTTGAAGATGTGGATGCAG

CAGAGTTGTACCCATGTGTAATGTTCTATAGCAGCAACCCGGGTGAGAAG GTGAAAATTTGTGATATGCAGATGCGTGGCACGCCACGGGATTTACTTC

CAGGAGACCCTATTTGTAGTCCAGTAGCAGCAGTGCTAGCTGAAGCCACTATTCAGCTTATCCGTATCCTTCACCGGACAGACCGTTGGACTTACTGTAT

TAATAAAAAGATGATGGAAAGACTCCATAAAATTAAGATATGTATTAAAGAGTCAGGTCAGAAGCTAAAGAAAAGCCGCTCGGTTCAAAGCCGAGAGGAA

AATGAAATGAGAGAGGAGAAAGAGAACAAAGAAGAAGAGAAAGGTAAACATAACAGGCATGGTCTTGCTGACCTCTCAGAACCGCAGCTGAGGACTCTTT

GCATAGAGGTGTGGCCCGTGCTAGCAGTAATAGGAGGAGTTGATGCTGGTCTTAGAGTTGGAGGTCGGTGTGTTCACAAGCAAACTGGGCGCCATGCCAC

GCTGCTGGGAGTGGTCAAAGAAGGCAGCACATCTGCCAAGGTCCAGTGGGATGAAGCAGAGATTACCATCAG CTTCCCAACTTTTTGGTCGCCTAGTGA

TACTCCATTATATAACCTGGAACCCTGTGAACCACTGCTGTTTGATGTGGCGCGATTCCGAGGCCTGACAGCTTCTGTGCTGCTGGACCTAACATATCTG

ACAGGCATTCATGAAGATGTGGGGAAACAGAGCATCAAGCGACATGAAAAGAAACACCGTCATGAGTCTGAGGAAAAGGGGGACATTGAGCAGAAACCTG

AGAGTGAATCCATTTTAGATGTGCGAACAGGGTTAATGTCTGATGATGTCAAAAGTCAGGGTACCACAAGCTCCAAATCAGAAAATGAAATAGCTTCATT

TTCTTTAGAACCAACACTGCCAGGTGTGGAATCCCAACATCAAATAACAGAAGGAAAGAGAAAAAATCATGAACACATATCCAAAACCCATGACATAGCT

CAGTCAGAAATCAGAGCAGTCCAGCTTTCCTATCTCTACCTCGGTGCTATGAAGTCACTTAGTGCTCTTCTTGGCTGTAGTAAATATGCTGAGCTCTTGC

TGATCCCAAAAGTTCTCGCTGAAAATGGCCACAACTCAGACTGTGCAAGTTCTCCAGTTGTTCACGAGGATGTGGAGATGCGTGCTGCTCTACAGTTCCT

GATGCGGCACATGGTGAAGCGAGCAGTCATGCGGTCACCCATAAAGCGAGCATTGGGGTTAGCTGATCTGGAACGGGCTCAAGCTATGATTTACAAACTG

GTGGTCCATGGGCTTTTGGAAGACCAGTTTGGGGGCAAAATTAAGCAAG AGATTGATCAACAAGCTGAAGAAAGTGACCAAGCACAGCAGGCACAGACA

CCAGTGACCACCAGCCCGTCAGCATCCAGCACAACTTCCTTTATGAGCAGCTCCCTCGAGGACACCACAACTGCCACCACTCCTGTCACGGACACAGAAA

CGGTGCCTGCATCTGAGTCCCCTGGAGTGATGCCACTTAGTCTTCTCAG GCAAATGTTTTCTAGTTATCCAACTACCACTGTACTTCCTACACGCCGTG

CACAGACTCCTCCAATATCATCATTACCAGCCTCTCCTTCTGATGAAGTAGGAAGAAGGCAGAGTTTAACTTCTCCAGATTCCCAGTCTACAAGGCCAGC

TAATCGCACAG CCTTGTCAGACCCAAGCAGCAGACTTTCAACGTCCCCTCCTCCCCCAGCAATTGCAGTCCCTTTACTGGAAATGGGCTTCTCTCTTCG

ACAGATTGCTAAAGCCATGGAAGCTACAG GTGCTCGAGGAGAGGCTGATGCCCAGAGCATCACTGTTCTTGCTATGTGGATGATAGAGCACCCTGGGCA

TGAGGATGAGGAGGAGCCCCAGCCCAGCAGCACAGCAGACTCCAGACATGGAGCGACAGTTCTGGGAAGTGGTGGGAAGTCAAATGATCCCTGTTATTTA

CAGTCACCAGGAGACATACCATCAGCTGATGCTGCTGAAATGGAGGAAGGCTTTAGTGAAAG CCCTGATAATCTGGATCATACAGAGAACGCAGCTTCT

GGAAGTGGACCACCAACTAGAGGTCGCTCAACAGTAACAAGAAGGCACAAATTTGACTTAGCAGCGCGCACTCTGCTTGCAAGAGCAG CGGGGTTATAC

CGCTCTGTGCAGGCTCACAGGAATCAAAGTCGGAGAGAAGGAATATCTTTGCAGCAAGACCCAGGGGCGTTGTATGACTTTAATTTAGATGAGGAATTGG

AAATTGATCTTGATGATGAAGCAATGGAAGCTATGTTTGGACAAGACCTGACCAGTGACAATGATATTCTGGGAATGTGGATCCCAGAGGTACTGGATTG

GCCTACCTGG - UNI7800 - CATGTTTGTGAGTCTGAAGACAGGGAAGAAGTGGTCGTGTGTGAACTTTGCGAATGCAATGTCGTCAGCTTCAACCA

GCACATGAAGAGAAACCACCCTGGCTGTGGGCGCAGTGCAAACCGCCAGGGGTATCGCAGCAATGGCTCCTATGTGGATGGCTGGTTTGGTGGTGAATGT

GGGAGTGGAAATCCATACTACCTGCTGTGTGGCAGCTGCAGGGAGAAGTACTTAGCCCTGAAGACCAAGACCAAGACTACAAATTCTGAAAG GTACAAG

GGACAAGCCCCAGATCTAATTGGCAAGCAGGACAGTGTGTATGAAG AAGACTGGGACATGTTAGATGTTGATGAAGATGAAAAACTAACAGGTGAAGAA

GAATTTGAATTGCTTGCTGGACCACTTGGTTTAAATGACCGGCGCATTGTGCCAGAACCAGTTCAGTTCCCTGACAGTGACCCCCTGGGAGCATCAGTAG

CAATGGTCACAGCTACCAACAGTATGGAAGAGACTTTGATGCAAATTG GTTGCCATGGGTCTGTGGAAAAGAGTTCCTCTGGGAGAGTAACATTAGGAG

AGCAGGCAGCAGCCCTTGCAAATCCTCATGACCGAGTAGTGGCTTTAAGGAGGGTGACTGCTGCCGCTCAAGTCCTTCTGGCCAGAACCATGGTCATGCG

AGCACTGTCTCTTCTCTCAGTCAG TGGTTCCAGTTGTAGCCTGGCTGCTGGTCTCGAGTCTCTGGGGTTAACAGATATCCGTACACTGGTTCGGTTAAT

GTGCTTAGCTGCAGCAGGGAGAGCTGGCCTTTCCACCAGCCCTTCTGCCATAGCCAGTACCTCAGAACGTTCACGAGGTGGACACAGTAAGGCCAGCAAG

CCCATTTCTTGCCTGGCCTACCTGAGCACAGCAGTGGGATGCCTGGCATCAAATACTCCAAGTGCTGCAAAGCTGCTGGTCCAGCTGTGTACACAG AAC

TTGATTTCTGCTGCAACAGGTGTCAATCTCACCACAGTGGATGATCCCATTCAGCGGAAGTTCCTACCAAGCTTTCTCCGTGGAATTGCTGAAGAGAATA

AGCTTGTAACATCCCCAAACTTTGTTGTAACTCAAGCCCTTGTGGCATTATTGGCAGACAAAGGGGCCAAACTGAGACCTAACTATGATAAGACAGAAAT

AGAAAAAAAAG GCCCTCTGGAGCTGGCTAATGCCCTGGCAGCCTGCTGCCTCTCCTCTAGGCTATCCTCACAGCATAGGCAATGGGCTGCTCAACAACT

TGTGCGCACTCTTGCTGCACATGACCGTGACAACCAAACTGCTCCACAAACACTTGCTGATATGGGAGGAGATCTCAGAAAATGCTCTTTTATCAAGTTG

GAGGCTCACCAAAACAGA GTAATGACATGTGTTTGGTGTAATAAAAAAGGCCTATTGGCTACCAGTGGCAATGATGGCACTATTCGGGTGTGGAATGTT

ACCAAGAAGCAATACTCACTACAGCAAACCTGTGTGTTCAATAGACT GGAAGGGGATGCTGAGGAAAGCCTTGGGTCACCCAGCGACCCAAGCTTCTCA

CCTGTTTCCTGGAGTATCAGTGGCAAATACCTTGCTGGGGCTTTGGAGAAGATGGTGAACATCTGGCAAGTTAATG GAGGAAAAGGATTAGTAGATATT

CAGCCTCACTGGGTATCTGCCCTGGCCTGGCCAGAAGAAGGTCCAGCTACAACCTGGTCAGGAGAGTCTCCAGAGTTACTGCTGGTGGGACGGATGGACG

GATCTCTAGGACTGATTGAAGTTGTTGATGTGTCCACTATGCACCGCCGAGAACTGGAACACTGCTATCGAAAGGATG TATCTGTCACCTGCATTGCTT

GGTTCAGTGAAGACAGACCATTTGCAGTAGGTTATTTTGATGGAAAATTGTTAATGGGAACAAAAGAACCACTTGAGAAAGGAGGCATTGTTCTTATTGA

TGCACATAAG GAAACTCTTGTTAGTATGAAATGGGACCCAACAGGCCATATTCTCATGACATGTGCCAAAGAAGAAAATGTGAAACTCTGGGGGCCTGT

TTCAGGATGCTGGCGCTGTCTACATTCACTCTGCCATCCATCCACTGTAAATGGCATCGCCTGGTGCAGCCTTCCAGGGAAAGGATCCAAGATGCAGTTA

CTGATGGCCAC TGGCTGTCAAAATGGCTTAGTGTGTGTTTGGCGTATTCCTCAAGATACCACACAGACCAGTATGACTAGCTCAGAAGGATGGTGGGAC

CAGGAATCAAATTGTCAG GATGGCTATAGGAAATCGGCAGGAGCCAAGTGTGTTTATCAGCTGCGGGGACACATCACACCTGTTCGGACTGTGGCCTTC

AGTTCTGATGGCTTGGCCTTGGTGTCTGGTGGACTTGGAGGGCTTATGAACATTTGGTCTTTAAGG GATGGCTCTGTCTTGCAAACTGTTGTAATCGGC

TCTGGAGCTATTCAGACCACAGTATGGATTCCAGAAGTTGGGGTAGCTGCCTGCTCAAATAGATCAAAG GATGTTTTGGTTGTTAATTGCACAGCAGAA

TGGGCTTCTGCCAATCACATTTTAGCAACTTGTAGAACAGCCCTGAAACAACAGGGTGTTCTGGGATTAAACATGGCTCCCTGCATGAGAGCATTTCTGG

AACGGCTACCCATGATGCTTCAAGAGCAATATGCCTATGAAAAG CCTCATGTAGTTTGTGGTGACCAGCTTGTTCATAGCCCCTACATGCAATGTTTGG

CTTCCCTTGCTGTGGGACTTCATCTGGATCAGCTGTTGTGTAACCCTCCAGTGCCACCACACCATCAGAACTGTCTCCCTGACCCTACATCCTGGAATCC

CAATGAGTGGGCCTGGTTAGAATGCTTTTCAACCACTATCAAGGCTGCAGAGGCCCTCACCAACGGAGCGCAGTTTCCAGAGTCATTTACTGTCCCAGAT

CTAGAGCCTGTTCCTGAGGATGAGCTAGTGCTCCTCATG GATAACAGCAAGTGGATCAATGGCATGGATGAACAAATTATGTCTTGGGCAACTTCCAGA

CCTGAG GACTGGCACCTGGGAGGTAAATGTGATGTCTACTTATGGGGTGCTGGCAGGCATGGACAGCTGGCCGAAGCTGGAAGAAACGTAATGGTACCA

GCTACAGCGCCTTCATTCTCACAAGCCCAACAG GTTATATGTGGTCAGAACTGTACCTTTGTCATCCAGGCCAATGGGACAGTGTTGGCTTGTGGGGAA

GGAAGTTATGGCAGATTAGGGCAAGGAAATTCAGACGACCTTCATGTGCTGACTGTGATTTCAGCCCTACAAG - UNI4352 - GTTTTGTGGTGACC

CAGCTGGTGACTTCCTGTGGCTCAGATGGGCACTCAATGGCCTTGACAGAAAGTGGTGAAGTCTTTAGCTGGGGAGATGGGGACTATGGCAAACTTGGCC

ATGGAAACAGTGACAGACAGCGGCGACCAAGGCAGATTGAGGCCTTACAAGGAGAAGAAGTGGTACAG ATGTCTTGTGGCTTCAAGCATTCAGCAGTGG

TAACATCGGATGGCAAACTCTTTACCTTTGGGAACGGTGACTATGGCCGTCTGGGTCTTGGAAACACCTCTAACAAAAAACTTCCAGAGAGAGTGACCGC

GCTGGAGGGATATCAGATTGGACAG GTGGCCTGTGGGCTGAACCACACCTTGGCAGTGTCAGCAGATGGCTCCATGGTGTGGGCCTTTGGAGATGGAGA

CTATGGTAAACTGGGCCTAGGAAATTCCACAGCAAAGTCTTCTCCTCAG AAAGTTGATGTCCTCTGTGGAATTGGAATTAAAAAGGTTGCTTGTGGAAC

TCAGTTTTCTGTGGCTTTGACTAAAGATGGTCATGTGTACACTTTTGGTCAAG ACCGATTGATAGGCTTGCCTGAGGGACGTGCTCGTAATCATAATCG

ACCCCAACAAATCCCTGTCCTGGCTGGAGTGGTCATTGAAGATGTGGCTGTTGGAGCTGAACACACACTTGCTTTGGCATCAACTGGAGATGTTTATGCC

TGGGGTAGCAATTCAGAAGGGCAG CTTGGCCTAGGCCACACCAACCACGTTCGAGAACCAACCCTGGTAACAGTTCTGCAAGGGAAAAACATCCGCCAG

ATTTCAGCGGGCCGTTGCCACAGTGCTGCATGGACAGCCCCACCAGTCCCACCAAGAGCACCAG GTGTGTCAGTGCCTCTCCAGTTGGGCCTGCCTGAC

GCAGTGCCCCCACAGTATGGGGCACTGAGAGAGGTGAGCATTCACACCGTGCGAGCAAGGCTCCGGCTGCTCTACCACTTCTCTGACCTCATGTACTCAT

CATGGAGGTTGCTGAATCTCAGCCCCAACAACCAG AACAGCACATCTCATTACAATGCTGGAACATGGGGCATTGTACAGGGACAACTTCGGCCTCTGT

TAGCTCCAAGAGTCTACACACTCCCCATGGTGCGCTCCATAGGGAAGACCATGGTTCAAGGCAAAAATTATGGGCCTCAGATTACTGTAAAGAGAATATC

AACCAG AGGACGGAAGTGTAAGCCCATCTTTGTTCAAATAGCAAGACAAGTGGTAAAGCTAAATGCATCAGACCTCCGTCTACCCTCCCGAGCATGGAA

GGTTAAGCTGGTTGGAGAAGGAGCTGATGATGCTGGAGGAGTGTTTGATGACACTATCACAGAAATGTGCCAG GAACTTGAGACTGGAATAGTTGACCT

TCTCATACCCTCTCCTAATGCTACTGCAGAAGTCGGTTACAACAGAGACAG - UNI30919 - GTTCCTTTTTAATCCCTCTGCCTGCCTTGATGAAC

ATTTAATGCAGTTTAAGTTCTTAGGAATTTTAATGGGGGTTGCCATTCGTACAAAGAAGCCTCTGGACCTTCACCTGGCCCCTCTGGTGTGGAAACAGCT

GTGCTGTGTCCCACTTACCCTGGAGGACCTGGAGGAAGTGGATCTGCTTTATGTGCAGACCCTTAACAGCATCCTTCACATTGAAGACAGTGGCATTACT

GAAGAGAGTTTCCATGAG ATGATTCCTCTTGATTCTTTTGTCGGCCAGAGTGCTGATGGTAAAATGGTCCCCATAATCCCTGGAGGAAATAGTATCCCA

CTCACGTTTTCCAATAGGAAGGAGTACGTAGAGAGGGCCATTGAGTATCGACTTCATGAGATGGACCGGCAG - UNI29253 - GTGGCTGCAGTTCG

AGAAGGGATGTCCTGGATTGTCCCTGTACCATTGCTGTCCCTCCTCACGGCCAAACAGCTGGAACAGATGGTGTGTGGGATGCCTGAGATCTGTGTGGAT

GTCCTGAAGAAGGTGGTGCGATACCGTGAGGTGGACGAGCAGCACCAGCTGGTGCAGTGGCTCTGGCGCACACTGGAAGAGTTCTCCAATGAGGAGCGGG

TGCTCTTCATGCGCTTCGTGTCTGGGCGCTCCCGACTGCCAGCCAACACCGCTGACATTTCCCAGAGGTTTCAAATCATGAAGGTCGATAGG - UNI29

195 - CCTTATGACAGTCTGCCTACCTCACAGACCTGCTTCTTCCAGCTGCGGCTGCCCCCCTATTCCAGCCAGCTGGTCATGGCTGAGCGCTTGAGAT

ATGCCATCAACAACTGCCGTTCAATTGACATGGACAACTACATGCTCTCAAGAAATGTGGACAATGCAGAGGGCTCTGACACTGACTACTGACTGATGAG

GGTGCTGTCACCCACCTTTCTCAATAATGCTCACTTCCAATTTGATGTTGGTATACTTTTATGGTAACTACATAGATGTTTTAGGAACATAAGCTGATAC

AAACAGTGGCCACATTTAGTTACTTCAAATGAAACAAAGAAATTAGATGGTTTTATTTTTCTGTGATTGTACAAAACAAAGAGCAGAAACTGCTCAGTCA

GGTTTTCCTCTGTATTTTTTGGTCACTGTGGATAAGTTTGCATGGAGCCATTTTGGTGTATTTTTAGTTGAGAATGATACATTTTTGTAAGCCCACCCAG

TGAACATGAAATTGTACATTGTGTATAATTGTTCATTAGAAAGGACAGTTTTACATGAATATTCATATATTTATTTTGTTTTAGTTTGATTCGCCTGTGC

AGGGTTCCTTATGCAGAGAAATAAAGCAGATTCAGGAATCAGA
Translation
 - UNI4010 - - UNI16525 - MATMVPPVKLKWLEHLNSSWITEDSESIATREGVTVLYSKLISNKEVVPLPQQVLCLKGPQLPDFERESLSSDE

QDHYLDALLSSQLALAKMVCSDSPFAGALRKRLLVLQRVFYALSNKYHDKGKVKQQQHSPESSSGSADVHSVSERPRSSTDALIEMGVRTGLSLLFALLR

QSWMMPVSGPGLSLCNDVIHTAIEVVSSLPPLSLANESKIPPMGLDCLSQVTTFLKGVTIPNSGADTLGRRLASELLLGLAAQRGSLRYLLEWIEMALGA

SAVVYTMEKNKLLSSQEGMISFDCFMAILMQMRRSL GSSADRSQWREPTRTSEGLCSLYEAALCLFEE - UNI35616 - VCRMASDYSRTCASPDS

IQTGDAPIVSETCEVYVWGSNSSHQLVEGTQEKILQPKLAPSFSDAQT IEAGQYCTFVISTDGSVRACGKGSYGRLGLGDSNNQSTLKKLTFEPHRSIK

KVSSSKGSDGHTLAFTTEGEVFSWGDGDYGKLGHGNSSTQKYPKLIQGPLQGK VVVCVSAGYRHSAAVTEDGELYTWGEGDFGRLG GHGDSNSRNIPT

LVKDISNVGEVSCGSSHTIALSKDGRTVWSFGGGDNG GKLGHGDTNRVYKPKVIEALQGMFIRKVCAGSQSSLALTSTGQ VYAWGCGACLGCGSSEAT

ALRPKLIEELAATRIVDISIGDSHCLSLSHD DNEVYAWGNNSMGQCGQGNSTGPITKPKKVSGLDGIAIQQISAGTSHSLAWTALPRDR RQVVAWHRP

YCVDLEESTFSHLRSFLERYCDKINSEIPPLPFPSSR REHHNFLKLCLKLLSNHLALALAGGVATSILGRQAGPLRNLLFRLMDSTVPDEIQE VVIET

LSVGATMLLPPLRERMELLHSLLPQGPDRWESLSKGQ RMQLDIILTSLQDHTHVASLLGYSSPSDAADLSTVCMGYGNLSDQPYGSQICHPDTHLAEIL

MKTLLRNLGFYT DQAFGELEKNSDKYLLGTSSSENSQPAHLHELLCSLQKQLLAFCHINNVTE NSSSVALLHKHLQLLLPHATDIYSRSANLLKESPW

NGSVGEKLRD DVIYVSAAGSMLCQIVNSLLLLPVSVARPLLSYLLDLLPPLDCLNRLLPAAALLEDQELQWPLHG GGPEVIDPAGVPLPQPAQSWVWL

VDLERTIALLIGRCLGGMLQGSPVSPEEQDTAYWMKTPLFSDGVEMDTPQLD DKCMSCLLEVALSGNEEQKPFDYKLRPEVAVYVDLALGCSKEPARSL

WISMQDYAVSKD - UNI6246 - DWDSATLSNESLLDTVSRFVLAALLKHTNLLSQACGESR RYQPGKSLSEVYRCVYKVRSRLLACKNLELIQTRS

SSRDRW ITDNQDSADVDPQEHSFTRTIDEEAEMEELAERDREDGHPEPEDEEEEREHEVMTAGK KIFQCFLSAREVARSRDRDRMNSGAGSGVRADDP

PPQSQQERRVSTDLPEGQDVYTAACNSVIHRCALLILGVSPVIDELQKRREEGQLQQPSVSASEGTGLMTR RSESLTAESRLVHASPSYRLIKSRSESD

LSQPESDEEGYAL SGRRNVDFDLASSHRKRG GPMHSQLESLSDSWTRLKHTRDWFYNSSYSFESDFDLTKSLGVHTLIENVVSFVSGDVGNAPGFKEP

EESMSTSPQASIIAMEQQQLRAE LRLEALHQILTLLSGMEEKGNVLLTGSRSSSGFQSSTLLTSVRLQFLAGCFGLGTVGHTGTKGESGRLHHYQ DGI

RAAKRNIQVEIQVAVHKIYQQLSATLERALQANKHHIE EAQQRLLLVTVFALSVHYQPVDVSLAISTGLLNVLSQLCGTDTMLGQPLQLLPKTGVSQLS

TALKVASTRLLQILAITTG GTYADKLSPKVVQSLLDLLCSQLKNLLSQTGVLFMASFGEGEEGEEEEKKVDSSGEAEKRDFRA AALRKQHAAELHLGD

FLVFLRRVVSSKAIQSKMASPKWTEVLLNIASQKCSSG GIPLVGNLRTRLLALHVLEAVLPACESGVEDDQMAQ VVERLFSLLSDCMWETPIAQAKHA

IQIKEKEQEIKLQ KQGELEEEDENLPIQEVSFDPEKAQCCIVENGQILTHGSGGKGYGLASTGVTSGCYQWK FYIVKENRGNEGTCVGVSRWPVHDFN

HRTTSDMWLYRAYSGNLYHNGEQTLTLSSFTQGDFITCVLDMEARTISFGKNGE EPKLAFEDVDAAELYPCVMFYSSNPGEK VKICDMQMRGTPRDLL

PGDPICSPVAAVLAEATIQLIRILHRTDRWTYCINKKMMERLHKIKICIKESGQKLKKSRSVQSREENEMREEKENKEEEKGKHNRHGLADLSEPQLRTL

CIEVWPVLAVIGGVDAGLRVGGRCVHKQTGRHATLLGVVKEGSTSAKVQWDEAEITIS SFPTFWSPSDTPLYNLEPCEPLLFDVARFRGLTASVLLDLT

YLTGIHEDVGKQSIKRHEKKHRHESEEKGDIEQKPESESILDVRTGLMSDDVKSQGTTSSKSENEIASFSLEPTLPGVESQHQITEGKRKNHEHISKTHD

IAQSEIRAVQLSYLYLGAMKSLSALLGCSKYAELLLIPKVLAENGHNSDCASSPVVHEDVEMRAALQFLMRHMVKRAVMRSPIKRALGLADLERAQAMIY

KLVVHGLLEDQFGGKIKQE EIDQQAEESDQAQQAQTPVTTSPSASSTTSFMSSSLEDTTTATTPVTDTETVPASESPGVMPLSLLR RQMFSSYPTTTV

LPTRRAQTPPISSLPASPSDEVGRRQSLTSPDSQSTRPANRTA ALSDPSSRLSTSPPPPAIAVPLLEMGFSLRQIAKAMEATG GARGEADAQSITVLA

MWMIEHPGHEDEEEPQPSSTADSRHGATVLGSGGKSNDPCYLQSPGDIPSADAAEMEEGFSES SPDNLDHTENAASGSGPPTRGRSTVTRRHKFDLAAR

TLLARAA AGLYRSVQAHRNQSRREGISLQQDPGALYDFNLDEELEIDLDDEAMEAMFGQDLTSDNDILGMWIPEVLDWPTW - UNI7800 - HVCES

EDREEVVVCELCECNVVSFNQHMKRNHPGCGRSANRQGYRSNGSYVDGWFGGECGSGNPYYLLCGSCREKYLALKTKTKTTNSER RYKGQAPDLIGKQD

SVYEE EDWDMLDVDEDEKLTGEEEFELLAGPLGLNDRRIVPEPVQFPDSDPLGASVAMVTATNSMEETLMQIG GCHGSVEKSSSGRVTLGEQAAALAN

PHDRVVALRRVTAAAQVLLARTMVMRALSLLSVS SGSSCSLAAGLESLGLTDIRTLVRLMCLAAAGRAGLSTSPSAIASTSERSRGGHSKASKPISCLA

YLSTAVGCLASNTPSAAKLLVQLCTQ NLISAATGVNLTTVDDPIQRKFLPSFLRGIAEENKLVTSPNFVVTQALVALLADKGAKLRPNYDKTEIEKKG 

GPLELANALAACCLSSRLSSQHRQWAAQQLVRTLAAHDRDNQTAPQTLADMGGDLRKCSFIKLEAHQNR VMTCVWCNKKGLLATSGNDGTIRVWNVTKK

QYSLQQTCVFNRL LEGDAEESLGSPSDPSFSPVSWSISGKYLAGALEKMVNIWQVNG GGKGLVDIQPHWVSALAWPEEGPATTWSGESPELLLVGRMD

GSLGLIEVVDVSTMHRRELEHCYRKDV VSVTCIAWFSEDRPFAVGYFDGKLLMGTKEPLEKGGIVLIDAHK ETLVSMKWDPTGHILMTCAKEENVKLW

GPVSGCWRCLHSLCHPSTVNGIAWCSLPGKGSKMQLLMAT TGCQNGLVCVWRIPQDTTQTSMTSSEGWWDQESNCQ DGYRKSAGAKCVYQLRGHITPV

RTVAFSSDGLALVSGGLGGLMNIWSLR DGSVLQTVVIGSGAIQTTVWIPEVGVAACSNRSK DVLVVNCTAEWASANHILATCRTALKQQGVLGLNMAP

CMRAFLERLPMMLQEQYAYEK PHVVCGDQLVHSPYMQCLASLAVGLHLDQLLCNPPVPPHHQNCLPDPTSWNPNEWAWLECFSTTIKAAEALTNGAQFP

ESFTVPDLEPVPEDELVLLM DNSKWINGMDEQIMSWATSRPE DWHLGGKCDVYLWGAGRHGQLAEAGRNVMVPATAPSFSQAQQ VICGQNCTFVIQA

NGTVLACGEGSYGRLGQGNSDDLHVLTVISALQG - UNI4352 - GFVVTQLVTSCGSDGHSMALTESGEVFSWGDGDYGKLGHGNSDRQRRPRQIEA

LQGEEVVQ MSCGFKHSAVVTSDGKLFTFGNGDYGRLGLGNTSNKKLPERVTALEGYQIGQ VACGLNHTLAVSADGSMVWAFGDGDYGKLGLGNSTAKS

SPQ KVDVLCGIGIKKVACGTQFSVALTKDGHVYTFGQD DRLIGLPEGRARNHNRPQQIPVLAGVVIEDVAVGAEHTLALASTGDVYAWGSNSEGQ LG

LGHTNHVREPTLVTVLQGKNIRQISAGRCHSAAWTAPPVPPRAPG GVSVPLQLGLPDAVPPQYGALREVSIHTVRARLRLLYHFSDLMYSSWRLLNLSP

NNQ NSTSHYNAGTWGIVQGQLRPLLAPRVYTLPMVRSIGKTMVQGKNYGPQITVKRISTR RGRKCKPIFVQIARQVVKLNASDLRLPSRAWKVKLVGE

GADDAGGVFDDTITEMCQ ELETGIVDLLIPSPNATAEVGYNRDR - UNI30919 - RFLFNPSACLDEHLMQFKFLGILMGVAIRTKKPLDLHLAPL

VWKQLCCVPLTLEDLEEVDLLYVQTLNSILHIEDSGITEESFHE MIPLDSFVGQSADGKMVPIIPGGNSIPLTFSNRKEYVERAIEYRLHEMDRQ - U

NI29253 - VAAVREGMSWIVPVPLLSLLTAKQLEQMVCGMPEICVDVLKKVVRYREVDEQHQLVQWLWRTLEEFSNEERVLFMRFVSGRSRLPANTAD

ISQRFQIMKVDR - UNI29195 - PYDSLPTSQTCFFQLRLPPYSSQLVMAERLRYAINNCRSIDMDNYMLSRNVDNAEGSDTDY
Transcript ENSMUST00000098618
Sequence
ATGGCGACTATGGTTCCACCAGTGAAACTGAAATGGCTTGAACATCTGAATAGCTCCTGGATCACAGAGGACAGTGAATCTATTGCTACAAGAGAGGGAG

TTACCGTTTTGTATTCTAAACTGATCAGCAATAAGGAAGTAGTACCTTTGCCTCAACAGGTTTTATGCCTCAAAGGACCACAGTTGCCTGATTTTGAACG

TGAGTCTCTTTCAAGTGATGAGCAGGACCATTATTTGGATGCCCTTCTTAGCAGCCAGCTAGCACTAGCAAAGATGGTATGTTCAGATTCTCCATTTGCT

GGGGCGCTAAGAAAACGACTGCTTGTACTCCAACGTGTCTTTTATGCACTTTCTAATAAGTACCATGACAAAGGCAAAGTGAAACAGCAGCAGCATTCTC

CGGAGAGCAGCTCTGGTTCAGCAGATGTCCATTCTGTCAGTGAACGCCCCCGGTCAAGCACTGATGCACTTATAGAAATGGGTGTCCGAACTGGTCTAAG

TTTGTTATTTGCACTTCTGAGACAGAGTTGGATGATGCCTGTGTCGGGACCTGGCCTCAGTCTCTGCAACGACGTTATTCATACTGCAATTGAAGTTGTG

AGCTCTTTGCCTCCATTGTCTTTAGCAAATGAAAGCAAGATTCCTCCTATGGGCTTGGACTGCTTATCACAAGTCACAACATTTCTTAAAGGAGTAACTA

TTCCCAATTCTGGGGCAGACACTTTAGGTCGTCGATTAGCTTCTGAGTTGCTGCTTGGCTTAGCAGCTCAGAGAGGCTCTTTGAGATACCTTCTTGAATG

GATAGAAATGGCTTTGGGGGCTTCAGCAGTTGTATATACTATGGAGAAAAACAAACTACTGTCAAGCCAGGAAGGAATGATCAGCTTTGACTGCTTTATG

GCTATATTAATGCAGATGAGGCGATCTTTG GGTTCATCTGCTGATCGGAGTCAGTGGAGAGAACCAACTAGAACATCTGAAGGCTTATGTTCACTCTAT

GAGGCAGCATTATGTCTTTTTGAAGAG - UNI35616 - GTTTGCAGAATGGCTTCTGACTATTCAAGAACATGTGCTAGCCCAGATAGCATTCAGAC

TGGTGATGCTCCCATTGTTTCTGAAACCTGTGAGGTATATGTTTGGGGCAGCAACAGCAGCCATCAGTTGGTAGAAGGCACACAGGAGAAAATACTACAA

CCCAAACTGGCTCCTAGTTTCTCTGATGCTCAGACC ATTGAAGCTGGACAGTATTGCACTTTTGTCATTTCTACAGATGGCTCTGTCAGAGCTTGTGGT

AAAGGCAGCTATGGGAGACTGGGCCTCGGAGATTCCAATAATCAGTCTACCTTAAAAAAGTTAACTTTTGAGCCTCACCGGTCTATTAAGAAAGTTTCAT

CATCTAAAGGCTCTGATGGTCACACTTTAGCTTTTACTACAGAAGGTGAAGTCTTCAGCTGGGGAGATGGTGACTATGGGAAACTAGGACATGGGAATAG

TTCAACCCAGAAATACCCCAAGCTTATCCAGGGCCCACTACAGGGAAAG GTAGTAGTTTGTGTATCAGCTGGATACAGACATAGTGCTGCTGTCACAGA

GGATGGTGAATTATATACTTGGGGTGAAGGAGACTTCGGAAGATTAG GTCATGGTGACAGTAACAGCCGTAACATTCCAACATTAGTAAAAGACATTAG

CAATGTAGGAGAGGTTTCTTGTGGAAGTTCACATACGATTGCTTTGTCCAAAGATGGAAGAACTGTGTGGTCTTTTGGAGGAGGTGACAATG GCAAACT

TGGCCATGGTGATACCAACAGAGTATATAAACCTAAAGTTATTGAAGCTTTACAAGGAATGTTTATTCGCAAAGTCTGTGCTGGGAGCCAATCTTCACTT

GCTTTGACATCAACAGGGCAG GTTTATGCTTGGGGCTGTGGGGCTTGTCTAGGTTGTGGTTCTTCAGAAGCTACTGCTTTGAGACCCAAGCTCATTGAA

GAATTGGCTGCCACAAGAATAGTTGATATTTCAATTGGAGATAGTCATTGTTTGTCTCTTTCTCATG ATAATGAAGTTTATGCCTGGGGCAATAACTCC

ATGGGACAATGTGGCCAGGGAAATTCAACAGGTCCTATTACTAAACCAAAGAAAGTGAGTGGCTTAGATGGCATAGCTATTCAGCAGATCTCAGCTGGAA

CATCACATAGTTTGGCATGGACTGCTCTCCCTAGGGACAG ACAAGTTGTTGCATGGCACAGGCCTTATTGTGTAGATCTTGAAGAGAGTACCTTCTCAC

ATCTGCGATCTTTTCTTGAGAGATACTGTGATAAAATAAACAGTGAGATTCCCCCACTGCCTTTCCCTTCATCAAG GGAACACCATAATTTTCTTAAGC

TGTGCCTGAAGTTGCTTTCAAATCACCTTGCTCTGGCCCTGGCGGGAGGGGTAGCTACCAGTATTCTTGGGAGACAGGCAGGTCCACTTCGAAATTTGCT

CTTCAGGCTGATGGATTCAACTGTCCCAGATGAAATCCAAGAG GTGGTAATTGAAACACTCTCAGTTGGAGCAACCATGCTGTTACCTCCATTAAGAGA

ACGGATGGAATTACTTCATTCTCTTTTACCCCAAGGACCTGATAGATGGGAAAGCTTATCCAAAGGACAG AGAATGCAACTGGATATAATTCTGACAAG

TTTACAAGATCATACCCATGTAGCCTCCCTTCTTGGCTATAGCTCACCTTCCGATGCTGCTGACCTTTCGACTGTGTGCATGGGATATGGAAACCTGTCA

GACCAACCATATGGTTCTCAGATCTGCCATCCAGACACTCATCTGGCCGAAATTTTAATGAAGACTCTCTTAAGAAATTTAGGGTTTTATACG GATCAA

GCATTTGGAGAGCTGGAAAAGAATAGTGATAAATATCTTCTTGGAACATCATCTTCAGAGAACAGTCAGCCTGCTCATCTTCATGAACTACTGTGTTCAT

TACAGAAACAACTGCTTGCTTTTTGCCACATTAATAATGTTACTGAG AACTCAAGCAGTGTGGCATTGCTTCATAAACATCTTCAGCTTTTGCTGCCTC

ATGCCACAGATATTTATTCACGTTCCGCAAATTTGCTTAAAGAAAGTCCTTGGAATGGCAGTGTTGGAGAAAAATTGAGAG ATGTGATCTATGTGTCAG

CTGCAGGCAGTATGCTCTGCCAGATTGTTAACTCCCTACTGTTACTCCCGGTGTCAGTGGCTCGTCCTTTATTGAGTTACCTCCTCGACCTCTTGCCACC

TCTTGATTGCCTTAATAGACTCCTGCCAGCTGCTGCTCTTTTAGAAGACCAAGAATTACAGTGGCCTCTTCATG GGGGGCCAGAAGTAATTGACCCAGC

TGGTGTGCCATTACCTCAACCAGCTCAATCTTGGGTATGGCTTGTGGATCTGGAAAGAACAATTGCTCTCCTCATTGGCCGATGTCTTGGTGGCATGCTT

CAAGGCTCCCCCGTGTCTCCAGAGGAACAGGATACTGCATATTGGATGAAAACACCACTTTTCAGTGATGGTGTGGAAATGGACACCCCTCAGTTGG AT

AAATGCATGAGCTGCCTACTAGAAGTAGCTCTTTCAGGAAATGAAGAACAGAAGCCTTTTGATTACAAATTGCGGCCTGAAGTTGCTGTCTATGTAGACT

TGGCATTGGGTTGTTCTAAAGAGCCTGCCAGAAGCCTTTGGATCAGCATGCAGGATTATGCTGTCAGTAAAG - UNI6246 - ATTGGGACAGTGCAA

CTTTAAGCAATGAGTCACTCTTGGACACTGTGTCTAGATTTGTTCTTGCAGCACTCCTCAAACACACAAACTTACTTAGTCAAGCATGTGGAGAAAGCCG

GCAA CCTGGTAAAAGCTTATCAGAAGTGTATCGTTGTGTATACAAAGTTCGAAGTCGTTTACTTGCTTGCAAGAACCTTGAACTTATTCAAACCAGGTC

ATCATCACGAGACAGATGG ATCACAGATAACCAAGACTCTGCAGATGTTGATCCTCAAGAACATTCCTTTACCCGGACCATTGATGAAGAAGCTGAAAT

GGAAGAGCTGGCTGAGAGAGACAGGGAGGATGGCCACCCAGAGCCAGAAGATGAGGAAGAAGAGCGGGAACATGAAGTAATGACAGCTGGCA AAATCTT

TCAGTGTTTCCTCTCAGCCCGAGAAGTAGCTCGTAGCCGAGACCGAGATAGAATGAACAGTGGGGCAGGGTCTGGGGTCCGAGCTGATGACCCACCTCCA

CAGTCTCAGCAGGAACGACGGGTCAGCACAGACCTTCCTGAGGGGCAGGATGTGTACACTGCTGCCTGCAACTCTGTGATCCACCGCTGTGCGCTCCTAA

TATTAGGAGTAAGTCCTGTGATTGATGAGCTTCAGAAGCGAAGGGAAGAAGGGCAGTTACAGCAGCCTTCTGTGAGTGCCTCTGAAGGCACCGGACTTAT

GACCAG GAGTGAAAGTCTCACTGCTGAGAGCCGCCTAGTTCATGCAAGTCCAAGTTACAGACTGATCAAATCAAGGAGTGAATCTGATCTGTCTCAGCC

TGAATCAGATGAAGAGGGTTATGCACTG AGTGGCCGACGAAATGTTGACTTTGATTTGGCATCATCTCATAGGAAGAGAG GTCCTATGCATAGTCAAT

TAGAATCTCTGAGTGACTCTTGGACTCGCCTGAAACATACCAGAGATTGGTTCTACAACTCCTCTTACTCCTTTGAGTCTGATTTTGACCTTACCAAGTC

TTTGGGAGTTCACACATTGATTGAAAATGTTGTAAGTTTTGTGAGTGGAGATGTGGGGAATGCTCCAGGTTTCAAGGAGCCGGAAGAAAGTATGTCTACA

AGTCCCCAGGCCTCCATCATTGCAATGGAGCAGCAACAGCTAAGGGCAGAG CTTCGTTTGGAGGCACTTCACCAGATCCTCACTCTATTGTCTGGCATG

GAAGAAAAGGGTAACGTCTTGCTGACAGGAAGCAGATCAAGTTCAGGCTTTCAGTCATCCACCCTGCTCACATCTGTGAGATTGCAGTTTCTAGCAGGGT

GTTTCGGTTTAGGTACTGTTGGACACACGGGAACCAAAGGAGAGAGTGGCCGACTGCATCACTATCAA GATGGAATCAGAGCAGCTAAAAGAAATATTC

AGGTTGAAATCCAGGTGGCTGTTCATAAGATTTACCAACAATTGTCTGCTACTCTAGAGAGAGCCCTTCAAGCAAATAAGCATCACATTG AAGCCCAGC

AACGTCTTCTTTTGGTTACAGTTTTTGCCCTCAGTGTTCATTATCAGCCAGTTGATGTTTCTTTGGCAATTTCCACTGGTCTGCTGAATGTATTGTCACA

GTTATGTGGAACAGACACCATGTTAGGACAGCCCCTGCAATTGTTGCCAAAGACTGGTGTTTCTCAGCTTAGCACAGCTTTGAAAGTGGCCAGTACAAGG

TTGCTCCAGATTCTAGCCATCACTACTGG AACTTATGCTGATAAACTGAGCCCCAAAGTAGTTCAGTCCTTGTTGGATCTACTCTGTAGTCAACTGAAG

AATTTATTGTCTCAAACTGGTGTGCTGTTTATGGCCTCTTTTGGAGAAGGAGAAGAGGGTGAAGAAGAAGAAAAAAAAGTTGACTCCAGTGGAGAAGCTG

AGAAGAGAGATTTCAGAG CTGCTCTTAGGAAACAACATGCAGCTGAACTCCATCTAGGGGATTTTTTAGTTTTTCTTCGCAGAGTTGTGTCTTCAAAAG

CAATTCAATCAAAAATGGCTTCCCCCAAATGGACAGAAGTGCTTCTAAACATAGCATCTCAGAAGTGTTCTTCAG GTATTCCTCTAGTTGGTAACTTAA

GAACGAGGCTTCTTGCTCTTCACGTCCTTGAGGCTGTGCTACCAGCTTGTGAATCTGGTGTAGAAGATGACCAAATGGCCCAG GTTGTTGAACGCTTAT

TTTCCCTTCTGTCGGATTGTATGTGGGAGACTCCCATTGCTCAGGCCAAACATGCTATTCAAATAAAGGAAAAAGAACAAGAAATAAAATTGCAG AAGC

AAGGAGAGTTGGAGGAAGAGGATGAAAATCTCCCCATACAAGAAGTCTCTTTTGATCCAGAGAAGGCTCAGTGTTGCATTGTGGAAAATGGACAGATTTT

AACTCATGGCAGTGGAGGAAAAGGATATGGATTAGCATCTACTGGGGTGACTTCTGGGTGCTATCAGTGGAAG TTTTATATTGTGAAGGAAAATAGAGG

TAATGAAGGTACATGTGTTGGAGTCTCTCGCTGGCCAGTACATGATTTTAATCACCGCACTACCTCAGACATGTGGCTCTATAGGGCGTATAGTGGTAAC

CTCTATCACAATGGAGAACAGACTCTGACGCTGTCCAGCTTTACTCAAGGGGATTTTATTACCTGTGTGTTAGACATGGAAGCCAGGACCATTTCCTTTG

GGAAAAATGGAGAG GAACCCAAATTAGCCTTTGAAGATGTGGATGCAGCAGAGTTGTACCCATGTGTAATGTTCTATAGCAGCAACCCGGGTGAGAAG 

GTGAAAATTTGTGATATGCAGATGCGTGGCACGCCACGGGATTTACTTCCAGGAGACCCTATTTGTAGTCCAGTAGCAGCAGTGCTAGCTGAAGCCACTA

TTCAGCTTATCCGTATCCTTCACCGGACAGACCGTTGGACTTACTGTATTAATAAAAAGATGATGGAAAGACTCCATAAAATTAAGATATGTATTAAAGA

GTCAGGTCAGAAGCTAAAGAAAAGCCGCTCGGTTCAAAGCCGAGAGGAAAATGAAATGAGAGAGGAGAAAGAGAACAAAGAAGAAGAGAAAGGTAAACAT

AACAGGCATGGTCTTGCTGACCTCTCAGAACCGCAGCTGAGGACTCTTTGCATAGAGGTGTGGCCCGTGCTAGCAGTAATAGGAGGAGTTGATGCTGGTC

TTAGAGTTGGAGGTCGGTGTGTTCACAAGCAAACTGGGCGCCATGCCACGCTGCTGGGAGTGGTCAAAGAAGGCAGCACATCTGCCAAGGTCCAGTGGGA

TGAAGCAGAGATTACCATCAG CTTCCCAACTTTTTGGTCGCCTAGTGATACTCCATTATATAACCTGGAACCCTGTGAACCACTGCTGTTTGATGTGGC

GCGATTCCGAGGCCTGACAGCTTCTGTGCTGCTGGACCTAACATATCTGACAGGCATTCATGAAGATGTGGGGAAACAGAGCATCAAGCGACATGAAAAG

AAACACCGTCATGAGTCTGAGGAAAAGGGGGACATTGAGCAGAAACCTGAGAGTGAATCCATTTTAGATGTGCGAACAGGGTTAATGTCTGATGATGTCA

AAAGTCAGGGTACCACAAGCTCCAAATCAGAAAATGAAATAGCTTCATTTTCTTTAGAACCAACACTGCCAGGTGTGGAATCCCAACATCAAATAACAGA

AGGAAAGAGAAAAAATCATGAACACATATCCAAAACCCATGACATAGCTCAGTCAGAAATCAGAGCAGTCCAGCTTTCCTATCTCTACCTCGGTGCTATG

AAGTCACTTAGTGCTCTTCTTGGCTGTAGTAAATATGCTGAGCTCTTGCTGATCCCAAAAGTTCTCGCTGAAAATGGCCACAACTCAGACTGTGCAAGTT

CTCCAGTTGTTCACGAGGATGTGGAGATGCGTGCTGCTCTACAGTTCCTGATGCGGCACATGGTGAAGCGAGCAGTCATGCGGTCACCCATAAAGCGAGC

ATTGGGGTTAGCTGATCTGGAACGGGCTCAAGCTATGATTTACAAACTGGTGGTCCATGGGCTTTTGGAAGACCAGTTTGGGGGCAAAATTAAGCAAG A

GATTGATCAACAAGCTGAAGAAAGTGACCAAGCACAGCAGGCACAGACACCAGTGACCACCAGCCCGTCAGCATCCAGCACAACTTCCTTTATGAGCAGC

TCCCTCGAGGACACCACAACTGCCACCACTCCTGTCACGGACACAGAAACGGTGCCTGCATCTGAGTCCCCTGGAGTGATGCCACTTAGTCTTCTCAG G

CAAATGTTTTCTAGTTATCCAACTACCACTGTACTTCCTACACGCCGTGCACAGACTCCTCCAATATCATCATTACCAGCCTCTCCTTCTGATGAAGTAG

GAAGAAGGCAGAGTTTAACTTCTCCAGATTCCCAGTCTACAAGGCCAGCTAATCGCACAG CCTTGTCAGACCCAAGCAGCAGACTTTCAACGTCCCCTC

CTCCCCCAGCAATTGCAGTCCCTTTACTGGAAATGGGCTTCTCTCTTCGACAGATTGCTAAAGCCATGGAAGCTACAG GTGCTCGAGGAGAGGCTGATG

CCCAGAGCATCACTGTTCTTGCTATGTGGATGATAGAGCACCCTGGGCATGAGGATGAGGAGGAGCCCCAGCCCAGCAGCACAGCAGACTCCAGACATGG

AGCGACAGTTCTGGGAAGTGGTGGGAAGTCAAATGATCCCTGTTATTTACAGTCACCAGGAGACATACCATCAGCTGATGCTGCTGAAATGGAGGAAGGC

TTTAGTGAAAG CCCTGATAATCTGGATCATACAGAGAACGCAGCTTCTGGAAGTGGACCACCAACTAGAGGTCGCTCAACAGTAACAAGAAGGCACAAA

TTTGACTTAGCAGCGCGCACTCTGCTTGCAAGAGCAG CGGGGTTATACCGCTCTGTGCAGGCTCACAGGAATCAAAGTCGGAGAGAAGGAATATCTTTG

CAGCAAGACCCAGGGGCGTTGTATGACTTTAATTTAGATGAGGAATTGGAAATTGATCTTGATGATGAAGCAATGGAAGCTATGTTTGGACAAGACCTGA

CCAGTGACAATGATATTCTGGGAATGTGGATCCCAGAGGTACTGGATTGGCCTACCTGG - UNI7800 - CATGTTTGTGAGTCTGAAGACAGGGAAG

AAGTGGTCGTGTGTGAACTTTGCGAATGCAATGTCGTCAGCTTCAACCAGCACATGAAGAGAAACCACCCTGGCTGTGGGCGCAGTGCAAACCGCCAGGG

GTATCGCAGCAATGGCTCCTATGTGGATGGCTGGTTTGGTGGTGAATGTGGGAGTGGAAATCCATACTACCTGCTGTGTGGCAGCTGCAGGGAGAAGTAC

TTAGCCCTGAAGACCAAGACCAAGACTACAAATTCTGAAAG GTACAAGGGACAAGCCCCAGATCTAATTGGCAAGCAGGACAGTGTGTATGAAG AAGA

CTGGGACATGTTAGATGTTGATGAAGATGAAAAACTAACAGGTGAAGAAGAATTTGAATTGCTTGCTGGACCACTTGGTTTAAATGACCGGCGCATTGTG

CCAGAACCAGTTCAGTTCCCTGACAGTGACCCCCTGGGAGCATCAGTAGCAATGGTCACAGCTACCAACAGTATGGAAGAGACTTTGATGCAAATTG GT

TGCCATGGGTCTGTGGAAAAGAGTTCCTCTGGGAGAGTAACATTAGGAGAGCAGGCAGCAGCCCTTGCAAATCCTCATGACCGAGTAGTGGCTTTAAGGA

GGGTGACTGCTGCCGCTCAAGTCCTTCTGGCCAGAACCATGGTCATGCGAGCACTGTCTCTTCTCTCAGTCAG TGGTTCCAGTTGTAGCCTGGCTGCTG

GTCTCGAGTCTCTGGGGTTAACAGATATCCGTACACTGGTTCGGTTAATGTGCTTAGCTGCAGCAGGGAGAGCTGGCCTTTCCACCAGCCCTTCTGCCAT

AGCCAGTACCTCAGAACGTTCACGAGGTGGACACAGTAAGGCCAGCAAGCCCATTTCTTGCCTGGCCTACCTGAGCACAGCAGTGGGATGCCTGGCATCA

AATACTCCAAGTGCTGCAAAGCTGCTGGTCCAGCTGTGTACACAG AACTTGATTTCTGCTGCAACAGGTGTCAATCTCACCACAGTGGATGATCCCATT

CAGCGGAAGTTCCTACCAAGCTTTCTCCGTGGAATTGCTGAAGAGAATAAGCTTGTAACATCCCCAAACTTTGTTGTAACTCAAGCCCTTGTGGCATTAT

TGGCAGACAAAGGGGCCAAACTGAGACCTAACTATGATAAGACAGAAATAGAAAAAAAAG GCCCTCTGGAGCTGGCTAATGCCCTGGCAGCCTGCTGCC

TCTCCTCTAGGCTATCCTCACAGCATAGGCAATGGGCTGCTCAACAACTTGTGCGCACTCTTGCTGCACATGACCGTGACAACCAAACTGCTCCACAAAC

ACTTGCTGATATGGGAGGAGATCTCAGAAAATGCTCTTTTATCAAGTTGGAGGCTCACCAAAACAGA GTAATGACATGTGTTTGGTGTAATAAAAAAGG

CCTATTGGCTACCAGTGGCAATGATGGCACTATTCGGGTGTGGAATGTTACCAAGAAGCAATACTCACTACAGCAAACCTGTGTGTTCAATAGACT GGA

AGGGGATGCTGAGGAAAGCCTTGGGTCACCCAGCGACCCAAGCTTCTCACCTGTTTCCTGGAGTATCAGTGGCAAATACCTTGCTGGGGCTTTGGAGAAG

ATGGTGAACATCTGGCAAGTTAATG GAGGAAAAGGATTAGTAGATATTCAGCCTCACTGGGTATCTGCCCTGGCCTGGCCAGAAGAAGGTCCAGCTACA

ACCTGGTCAGGAGAGTCTCCAGAGTTACTGCTGGTGGGACGGATGGACGGATCTCTAGGACTGATTGAAGTTGTTGATGTGTCCACTATGCACCGCCGAG

AACTGGAACACTGCTATCGAAAGGATG TATCTGTCACCTGCATTGCTTGGTTCAGTGAAGACAGACCATTTGCAGTAGGTTATTTTGATGGAAAATTGT

TAATGGGAACAAAAGAACCACTTGAGAAAGGAGGCATTGTTCTTATTGATGCACATAAG GAAACTCTTGTTAGTATGAAATGGGACCCAACAGGCCATA

TTCTCATGACATGTGCCAAAGAAGAAAATGTGAAACTCTGGGGGCCTGTTTCAGGATGCTGGCGCTGTCTACATTCACTCTGCCATCCATCCACTGTAAA

TGGCATCGCCTGGTGCAGCCTTCCAGGGAAAGGATCCAAGATGCAGTTACTGATGGCCAC TGGCTGTCAAAATGGCTTAGTGTGTGTTTGGCGTATTCC

TCAAGATACCACACAGACCAGTATGACTAGCTCAGAAGGATGGTGGGACCAGGAATCAAATTGTCAG GATGGCTATAGGAAATCGGCAGGAGCCAAGTG

TGTTTATCAGCTGCGGGGACACATCACACCTGTTCGGACTGTGGCCTTCAGTTCTGATGGCTTGGCCTTGGTGTCTGGTGGACTTGGAGGGCTTATGAAC

ATTTGGTCTTTAAGG GATGGCTCTGTCTTGCAAACTGTTGTAATCGGCTCTGGAGCTATTCAGACCACAGTATGGATTCCAGAAGTTGGGGTAGCTGCC

TGCTCAAATAGATCAAAG GATGTTTTGGTTGTTAATTGCACAGCAGAATGGGCTTCTGCCAATCACATTTTAGCAACTTGTAGAACAGCCCTGAAACAA

CAGGGTGTTCTGGGATTAAACATGGCTCCCTGCATGAGAGCATTTCTGGAACGGCTACCCATGATGCTTCAAGAGCAATATGCCTATGAAAAG CCTCAT

GTAGTTTGTGGTGACCAGCTTGTTCATAGCCCCTACATGCAATGTTTGGCTTCCCTTGCTGTGGGACTTCATCTGGATCAGCTGTTGTGTAACCCTCCAG

TGCCACCACACCATCAGAACTGTCTCCCTGACCCTACATCCTGGAATCCCAATGAGTGGGCCTGGTTAGAATGCTTTTCAACCACTATCAAGGCTGCAGA

GGCCCTCACCAACGGAGCGCAGTTTCCAGAGTCATTTACTGTCCCAGATCTAGAGCCTGTTCCTGAGGATGAGCTAGTGCTCCTCATG GATAACAGCAA

GTGGATCAATGGCATGGATGAACAAATTATGTCTTGGGCAACTTCCAGACCTGAG GACTGGCACCTGGGAGGTAAATGTGATGTCTACTTATGGGGTGC

TGGCAGGCATGGACAGCTGGCCGAAGCTGGAAGAAACGTAATGGTACCAGCTACAGCGCCTTCATTCTCACAAGCCCAACAG GTTATATGTGGTCAGAA

CTGTACCTTTGTCATCCAGGCCAATGGGACAGTGTTGGCTTGTGGGGAAGGAAGTTATGGCAGATTAGGGCAAGGAAATTCAGACGACCTTCATGTGCTG

ACTGTGATTTCAGCCCTACAAG - UNI4352 - GTTTTGTGGTGACCCAGCTGGTGACTTCCTGTGGCTCAGATGGGCACTCAATGGCCTTGACAGAA

AGTGGTGAAGTCTTTAGCTGGGGAGATGGGGACTATGGCAAACTTGGCCATGGAAACAGTGACAGACAGCGGCGACCAAGGCAGATTGAGGCCTTACAAG

GAGAAGAAGTGGTACAG ATGTCTTGTGGCTTCAAGCATTCAGCAGTGGTAACATCGGATGGCAAACTCTTTACCTTTGGGAACGGTGACTATGGCCGTC

TGGGTCTTGGAAACACCTCTAACAAAAAACTTCCAGAGAGAGTGACCGCGCTGGAGGGATATCAGATTGGACAG GTGGCCTGTGGGCTGAACCACACCT

TGGCAGTGTCAGCAGATGGCTCCATGGTGTGGGCCTTTGGAGATGGAGACTATGGTAAACTGGGCCTAGGAAATTCCACAGCAAAGTCTTCTCCTCAG A

AAGTTGATGTCCTCTGTGGAATTGGAATTAAAAAGGTTGCTTGTGGAACTCAGTTTTCTGTGGCTTTGACTAAAGATGGTCATGTGTACACTTTTGGTCA

AG ACCGATTGATAGGCTTGCCTGAGGGACGTGCTCGTAATCATAATCGACCCCAACAAATCCCTGTCCTGGCTGGAGTGGTCATTGAAGATGTGGCTGT

TGGAGCTGAACACACACTTGCTTTGGCATCAACTGGAGATGTTTATGCCTGGGGTAGCAATTCAGAAGGGCAG CTTGGCCTAGGCCACACCAACCACGT

TCGAGAACCAACCCTGGTAACAGTTCTGCAAGGGAAAAACATCCGCCAGATTTCAGCGGGCCGTTGCCACAGTGCTGCATGGACAGCCCCACCAGTCCCA

CCAAGAGCACCAG GTGTGTCAGTGCCTCTCCAGTTGGGCCTGCCTGACGCAGTGCCCCCACAGTATGGGGCACTGAGAGAGGTGAGCATTCACACCGTG

CGAGCAAGGCTCCGGCTGCTCTACCACTTCTCTGACCTCATGTACTCATCATGGAGGTTGCTGAATCTCAGCCCCAACAACCAG AACAGCACATCTCAT

TACAATGCTGGAACATGGGGCATTGTACAGGGACAACTTCGGCCTCTGTTAGCTCCAAGAGTCTACACACTCCCCATGGTGCGCTCCATAGGGAAGACCA

TGGTTCAAGGCAAAAATTATGGGCCTCAGATTACTGTAAAGAGAATATCAACCAG AGGACGGAAGTGTAAGCCCATCTTTGTTCAAATAGCAAGACAAG

TGGTAAAGCTAAATGCATCAGACCTCCGTCTACCCTCCCGAGCATGGAAGGTTAAGCTGGTTGGAGAAGGAGCTGATGATGCTGGAGGAGTGTTTGATGA

CACTATCACAGAAATGTGCCAG GAACTTGAGACTGGAATAGTTGACCTTCTCATACCCTCTCCTAATGCTACTGCAGAAGTCGGTTACAACAGAGACAG

 - UNI30919 - GTTCCTTTTTAATCCCTCTGCCTGCCTTGATGAACATTTAATGCAGTTTAAGTTCTTAGGAATTTTAATGGGGGTTGCCATTCGTA

CAAAGAAGCCTCTGGACCTTCACCTGGCCCCTCTGGTGTGGAAACAGCTGTGCTGTGTCCCACTTACCCTGGAGGACCTGGAGGAAGTGGATCTGCTTTA

TGTGCAGACCCTTAACAGCATCCTTCACATTGAAGACAGTGGCATTACTGAAGAGAGTTTCCATGAG ATGATTCCTCTTGATTCTTTTGTCGGCCAGAG

TGCTGATGGTAAAATGGTCCCCATAATCCCTGGAGGAAATAGTATCCCACTCACGTTTTCCAATAGGAAGGAGTACGTAGAGAGGGCCATTGAGTATCGA

CTTCATGAGATGGACCGGCAG - UNI29253 - GTGGCTGCAGTTCGAGAAGGGATGTCCTGGATTGTCCCTGTACCATTGCTGTCCCTCCTCACGGC

CAAACAGCTGGAACAGATGGTGTGTGGGATGCCTGAGATCTGTGTGGATGTCCTGAAGAAGGTGGTGCGATACCGTGAGGTGGACGAGCAGCACCAGCTG

GTGCAGTGGCTCTGGCGCACACTGGAAGAGTTCTCCAATGAGGAGCGGGTGCTCTTCATGCGCTTCGTGTCTGGGCGCTCCCGACTGCCAGCCAACACCG

CTGACATTTCCCAGAGGTTTCAAATCATGAAGGTCGATAGG - UNI29195 - CCTTATGACAGTCTGCCTACCTCACAGACCTGCTTCTTCCAGCTG

CGGCTGCCCCCCTATTCCAGCCAGCTGGTCATGGCTGAGCGCTTGAGATATGCCATCAACAACTGCCGTTCAATTGACATGGACAACTACATGCTCTCAA

GAAATGTGGACAATGCAGAGGGCTCTGACACTGACTACTGACTGATGAGGGTGCTGTCACCCACCTTTCTCAATAATGCTCACTTCCAATTTGATGTTGG

TATACTTTTATGGTAACTACATAGATGTTTTAGGAACATAAGCTGATACAAACAGTGGCCACATTTAGTTACTTCAAATGAAACAAAGAAATTAGATGGT

TTTATTTTTCTGTGATTGTACAAAACAAAGAGCAGAAACTGCTCAGTCAGGTTTTCCTCTGTATTTTTTGGTCACTGTGGATAAGTTTGCATGGAGCCAT

TTTGGTGTATTTTTAGTTGAGAATGATACATTTTTGTAAGCCCACCCAGTGAACATGAAATTGTACATTGTGTATAATTGTTCATTAGAAAGGACAGTTT

TACATGAATATTCATATATTTATTTTGTTTTAGTTTGATTCGCCTGTGCAGGGTTCCTTATGCAGAGAAATAAAGCAGATTCAGG
Translation
MATMVPPVKLKWLEHLNSSWITEDSESIATREGVTVLYSKLISNKEVVPLPQQVLCLKGPQLPDFERESLSSDEQDHYLDALLSSQLALAKMVCSDSPFA

GALRKRLLVLQRVFYALSNKYHDKGKVKQQQHSPESSSGSADVHSVSERPRSSTDALIEMGVRTGLSLLFALLRQSWMMPVSGPGLSLCNDVIHTAIEVV

SSLPPLSLANESKIPPMGLDCLSQVTTFLKGVTIPNSGADTLGRRLASELLLGLAAQRGSLRYLLEWIEMALGASAVVYTMEKNKLLSSQEGMISFDCFM

AILMQMRRSL GSSADRSQWREPTRTSEGLCSLYEAALCLFEE - UNI35616 - VCRMASDYSRTCASPDSIQTGDAPIVSETCEVYVWGSNSSHQL

VEGTQEKILQPKLAPSFSDAQT IEAGQYCTFVISTDGSVRACGKGSYGRLGLGDSNNQSTLKKLTFEPHRSIKKVSSSKGSDGHTLAFTTEGEVFSWGD

GDYGKLGHGNSSTQKYPKLIQGPLQGK VVVCVSAGYRHSAAVTEDGELYTWGEGDFGRLG GHGDSNSRNIPTLVKDISNVGEVSCGSSHTIALSKDGR

TVWSFGGGDNG GKLGHGDTNRVYKPKVIEALQGMFIRKVCAGSQSSLALTSTGQ VYAWGCGACLGCGSSEATALRPKLIEELAATRIVDISIGDSHCL

SLSHD DNEVYAWGNNSMGQCGQGNSTGPITKPKKVSGLDGIAIQQISAGTSHSLAWTALPRDR RQVVAWHRPYCVDLEESTFSHLRSFLERYCDKINS

EIPPLPFPSSR REHHNFLKLCLKLLSNHLALALAGGVATSILGRQAGPLRNLLFRLMDSTVPDEIQE VVIETLSVGATMLLPPLRERMELLHSLLPQG

PDRWESLSKGQ RMQLDIILTSLQDHTHVASLLGYSSPSDAADLSTVCMGYGNLSDQPYGSQICHPDTHLAEILMKTLLRNLGFYT DQAFGELEKNSDK

YLLGTSSSENSQPAHLHELLCSLQKQLLAFCHINNVTE NSSSVALLHKHLQLLLPHATDIYSRSANLLKESPWNGSVGEKLRD DVIYVSAAGSMLCQI

VNSLLLLPVSVARPLLSYLLDLLPPLDCLNRLLPAAALLEDQELQWPLHG GGPEVIDPAGVPLPQPAQSWVWLVDLERTIALLIGRCLGGMLQGSPVSP

EEQDTAYWMKTPLFSDGVEMDTPQLD DKCMSCLLEVALSGNEEQKPFDYKLRPEVAVYVDLALGCSKEPARSLWISMQDYAVSKD - UNI6246 - D

WDSATLSNESLLDTVSRFVLAALLKHTNLLSQACGESRQ PGKSLSEVYRCVYKVRSRLLACKNLELIQTRSSSRDRW ITDNQDSADVDPQEHSFTRTI

DEEAEMEELAERDREDGHPEPEDEEEEREHEVMTAGK KIFQCFLSAREVARSRDRDRMNSGAGSGVRADDPPPQSQQERRVSTDLPEGQDVYTAACNSV

IHRCALLILGVSPVIDELQKRREEGQLQQPSVSASEGTGLMTR RSESLTAESRLVHASPSYRLIKSRSESDLSQPESDEEGYAL SGRRNVDFDLASSH

RKRG GPMHSQLESLSDSWTRLKHTRDWFYNSSYSFESDFDLTKSLGVHTLIENVVSFVSGDVGNAPGFKEPEESMSTSPQASIIAMEQQQLRAE LRLE

ALHQILTLLSGMEEKGNVLLTGSRSSSGFQSSTLLTSVRLQFLAGCFGLGTVGHTGTKGESGRLHHYQ DGIRAAKRNIQVEIQVAVHKIYQQLSATLER

ALQANKHHIE EAQQRLLLVTVFALSVHYQPVDVSLAISTGLLNVLSQLCGTDTMLGQPLQLLPKTGVSQLSTALKVASTRLLQILAITTG GTYADKLS

PKVVQSLLDLLCSQLKNLLSQTGVLFMASFGEGEEGEEEEKKVDSSGEAEKRDFRA AALRKQHAAELHLGDFLVFLRRVVSSKAIQSKMASPKWTEVLL

NIASQKCSSG GIPLVGNLRTRLLALHVLEAVLPACESGVEDDQMAQ VVERLFSLLSDCMWETPIAQAKHAIQIKEKEQEIKLQ KQGELEEEDENLPI

QEVSFDPEKAQCCIVENGQILTHGSGGKGYGLASTGVTSGCYQWK FYIVKENRGNEGTCVGVSRWPVHDFNHRTTSDMWLYRAYSGNLYHNGEQTLTLS

SFTQGDFITCVLDMEARTISFGKNGE EPKLAFEDVDAAELYPCVMFYSSNPGEK VKICDMQMRGTPRDLLPGDPICSPVAAVLAEATIQLIRILHRTD

RWTYCINKKMMERLHKIKICIKESGQKLKKSRSVQSREENEMREEKENKEEEKGKHNRHGLADLSEPQLRTLCIEVWPVLAVIGGVDAGLRVGGRCVHKQ

TGRHATLLGVVKEGSTSAKVQWDEAEITIS SFPTFWSPSDTPLYNLEPCEPLLFDVARFRGLTASVLLDLTYLTGIHEDVGKQSIKRHEKKHRHESEEK

GDIEQKPESESILDVRTGLMSDDVKSQGTTSSKSENEIASFSLEPTLPGVESQHQITEGKRKNHEHISKTHDIAQSEIRAVQLSYLYLGAMKSLSALLGC

SKYAELLLIPKVLAENGHNSDCASSPVVHEDVEMRAALQFLMRHMVKRAVMRSPIKRALGLADLERAQAMIYKLVVHGLLEDQFGGKIKQE EIDQQAEE

SDQAQQAQTPVTTSPSASSTTSFMSSSLEDTTTATTPVTDTETVPASESPGVMPLSLLR RQMFSSYPTTTVLPTRRAQTPPISSLPASPSDEVGRRQSL

TSPDSQSTRPANRTA ALSDPSSRLSTSPPPPAIAVPLLEMGFSLRQIAKAMEATG GARGEADAQSITVLAMWMIEHPGHEDEEEPQPSSTADSRHGAT

VLGSGGKSNDPCYLQSPGDIPSADAAEMEEGFSES SPDNLDHTENAASGSGPPTRGRSTVTRRHKFDLAARTLLARAA AGLYRSVQAHRNQSRREGIS

LQQDPGALYDFNLDEELEIDLDDEAMEAMFGQDLTSDNDILGMWIPEVLDWPTW - UNI7800 - HVCESEDREEVVVCELCECNVVSFNQHMKRNHP

GCGRSANRQGYRSNGSYVDGWFGGECGSGNPYYLLCGSCREKYLALKTKTKTTNSER RYKGQAPDLIGKQDSVYEE EDWDMLDVDEDEKLTGEEEFEL

LAGPLGLNDRRIVPEPVQFPDSDPLGASVAMVTATNSMEETLMQIG GCHGSVEKSSSGRVTLGEQAAALANPHDRVVALRRVTAAAQVLLARTMVMRAL

SLLSVS SGSSCSLAAGLESLGLTDIRTLVRLMCLAAAGRAGLSTSPSAIASTSERSRGGHSKASKPISCLAYLSTAVGCLASNTPSAAKLLVQLCTQ N

LISAATGVNLTTVDDPIQRKFLPSFLRGIAEENKLVTSPNFVVTQALVALLADKGAKLRPNYDKTEIEKKG GPLELANALAACCLSSRLSSQHRQWAAQ

QLVRTLAAHDRDNQTAPQTLADMGGDLRKCSFIKLEAHQNR VMTCVWCNKKGLLATSGNDGTIRVWNVTKKQYSLQQTCVFNRL LEGDAEESLGSPSD

PSFSPVSWSISGKYLAGALEKMVNIWQVNG GGKGLVDIQPHWVSALAWPEEGPATTWSGESPELLLVGRMDGSLGLIEVVDVSTMHRRELEHCYRKDV 

VSVTCIAWFSEDRPFAVGYFDGKLLMGTKEPLEKGGIVLIDAHK ETLVSMKWDPTGHILMTCAKEENVKLWGPVSGCWRCLHSLCHPSTVNGIAWCSLP

GKGSKMQLLMAT TGCQNGLVCVWRIPQDTTQTSMTSSEGWWDQESNCQ DGYRKSAGAKCVYQLRGHITPVRTVAFSSDGLALVSGGLGGLMNIWSLR 

DGSVLQTVVIGSGAIQTTVWIPEVGVAACSNRSK DVLVVNCTAEWASANHILATCRTALKQQGVLGLNMAPCMRAFLERLPMMLQEQYAYEK PHVVCG

DQLVHSPYMQCLASLAVGLHLDQLLCNPPVPPHHQNCLPDPTSWNPNEWAWLECFSTTIKAAEALTNGAQFPESFTVPDLEPVPEDELVLLM DNSKWIN

GMDEQIMSWATSRPE DWHLGGKCDVYLWGAGRHGQLAEAGRNVMVPATAPSFSQAQQ VICGQNCTFVIQANGTVLACGEGSYGRLGQGNSDDLHVLTV

ISALQG - UNI4352 - GFVVTQLVTSCGSDGHSMALTESGEVFSWGDGDYGKLGHGNSDRQRRPRQIEALQGEEVVQ MSCGFKHSAVVTSDGKLFT

FGNGDYGRLGLGNTSNKKLPERVTALEGYQIGQ VACGLNHTLAVSADGSMVWAFGDGDYGKLGLGNSTAKSSPQ KVDVLCGIGIKKVACGTQFSVALT

KDGHVYTFGQD DRLIGLPEGRARNHNRPQQIPVLAGVVIEDVAVGAEHTLALASTGDVYAWGSNSEGQ LGLGHTNHVREPTLVTVLQGKNIRQISAGR

CHSAAWTAPPVPPRAPG GVSVPLQLGLPDAVPPQYGALREVSIHTVRARLRLLYHFSDLMYSSWRLLNLSPNNQ NSTSHYNAGTWGIVQGQLRPLLAP

RVYTLPMVRSIGKTMVQGKNYGPQITVKRISTR RGRKCKPIFVQIARQVVKLNASDLRLPSRAWKVKLVGEGADDAGGVFDDTITEMCQ ELETGIVDL

LIPSPNATAEVGYNRDR - UNI30919 - RFLFNPSACLDEHLMQFKFLGILMGVAIRTKKPLDLHLAPLVWKQLCCVPLTLEDLEEVDLLYVQTLNS

ILHIEDSGITEESFHE MIPLDSFVGQSADGKMVPIIPGGNSIPLTFSNRKEYVERAIEYRLHEMDRQ - UNI29253 - VAAVREGMSWIVPVPLLS

LLTAKQLEQMVCGMPEICVDVLKKVVRYREVDEQHQLVQWLWRTLEEFSNEERVLFMRFVSGRSRLPANTADISQRFQIMKVDR - UNI29195 - PY

DSLPTSQTCFFQLRLPPYSSQLVMAERLRYAINNCRSIDMDNYMLSRNVDNAEGSDTDY

For any suggestions or comments, please send an email to unitrap@crg.es