The residues that make up the domains are coloured to make the domain ranges clearer, each domain colour alternates between blue and green. The linker regions are displayed in black
and the mutations are displayed in red. The sequence for MCP is that of the major isoform (Isoform A).
FH Protein Sequence
MRLLAKIICL 
MLWAICVAED 
CNELPPRRNT 
EILTGSWSDQ 
TYPEGTQAIY 
KCRPGYRSLG  60
NVIMVCRKGE 
WVALNPLRKC 
QKRPCGHPGD 
TPFGTFTLTG 
GNVFEYGVKA 
VYTCNEGYQL  120
LGEINYRECD 
TDGWTNDIPI 
CEVVKCLPVT 
APENGKIVSS 
AMEPDREYHF 
GQAVRFVCNS  180
GYKIEGDEEM 
HCSDDGFWSK 
EKPKCVEISC 
KSPDVINGSP 
ISQKIIYKEN 
ERFQYKCNMG  240
YEYSERGDAV 
CTESGWRPLP 
SCEEKSCDNP 
YIPNGDYSPL 
RIKHRTGDEI 
TYQCRNGFYP  300
ATRGNTAKCT 
STGWIPAPRC 
TLKPCDYPDI 
KHGGLYHENM 
RRPYFPVAVG 
KYYSYYCDEH  360
FETPSGSYWD 
HIHCTQDGWS 
PAVPCLRKCY 
FPYLENGYNQ 
NHGRKFVQGK 
SIDVACHPGY  420
ALPKAQTTVT 
CMENGWSPTP 
RCIRVKTCSK 
SSIDIENGFI 
SESQYTYALK 
EKAKYQCKLG  480
YVTADGETSG 
SIRCGKDGWS 
AQPTCIKSCD 
IPVFMNARTK 
NDFTWFKLND 
TLDYECHDGY  540
ESNTGSTTGS 
IVCGYNGWSD 
LPICYERECE 
LPKIDVHLVP 
DRKKDQYKVG 
EVLKFSCKPG  600
FTIVGPNSVQ 
CYHFGLSPDL 
PICKEQVQSC 
GPPPELLNGN 
VKEKTKEEYG 
HSEVVEYYCN  660
PRFLMKGPNK 
IQCVDGEWTT 
LPVCIVEEST 
CGDIPELEHG 
WAQLSSPPYY 
YGDSVEFNCS  720
ESFTMIGHRS 
ITCIHGVWTQ 
LPQCVAIDKL 
KKCKSSNLII 
LEEHLKNKKE 
FDHNSNIRYR  780
CRGKEGWIHT 
VCINGRWDPE 
VNCSMAQIQL 
CPPPPQIPNS 
HNMTTTLNYR 
DGEKVSVLCQ  840
ENYLIQEGEE 
ITCKDGRWQS 
IPLCVEKIPC 
SQPPQIEHGT 
INSSRSSQES 
YAHGTKLSYT  900
CEGGFRISEE 
NETTCYMGKW 
SSPPQCEGLP 
CKSPPEISHG 
VVAHMSDSYQ 
YGEEVTYKCF  960
EGFGIDGPAI 
AKCLGEKWSH 
PPSCIKTDCL 
SLPSFENAIP 
MGEKKDVYKA 
GEQVTYTCAT  1020
YYKMDGASNV 
TCINSRWTGR 
PTCRDTSCVN 
PPTVQNAYIV 
SRQMSKYPSG 
ERVRYQCRSP  1080
YEMFGDEEVM 
CLNGNWTEPP 
QCKDSTGKCG 
PPPPIDNGDI 
TSFPLSVYAP 
ASSVEYQCQN  1140
LYQLEGNKRI 
TCRNGQWSEP 
PKCLHPCVIS 
REIMENYNIA 
LRWTAKQKLY 
SRTGESVEFV  1200
CKRGYRLSSR 
SHTLRTTCWD 
GKLEYPTCAK 
R FH Nucleotide Sequence
ATGAGACTTC TAGCAAAGAT TATTTGCCTT ATGTTATGGG CTATTTGTGT AGCAGAAGAT 60
M R L L A K I I C L M L W A I C V A E D 20
TGCAATGAAC TTCCTCCA
AG AAGAAATACA GAAATTCTGA CAGGTTCCTG GTCTGACCAA 120
C N E L P P R R N T E I L T G S W S D Q 40
ACATATCCAG AAGGCACCCA GGCTATCTAT AAATGCC
GCC CTGGATATAG ATCTCTTGGA 180
T Y P E G T Q A I Y K C R P G Y R S L G 60
AAT
GTAATAA TGGTATGCAG GAAGGGAGAA TGGGTTGCTC TTAATCCATT A
AGGAAATGT 240
N V I M V C R K G E W V A L N P L R K C 80
CAGAAAAGGC CCTGTGGACA TCCTGGAGAT ACTCCTTTTG GTACTTTTAC CCTTACAGGA 300
Q K R P C G H P G D T P F G T F T L T G 100
GGAAATGTGT TTGAATATGG TGTAAAAGCT GTGTATACAT GTAATGAGGG
GTATCAATTG 360
G N V F E Y G V K A V Y T C N E G Y Q L 120
CTAGGTGAGA
TTAATTACC
G TGAATGTGAC ACAGATGGAT GGACCAATGA TATTCCTATA 420
L G E I N Y R E C D T D G W T N D I P I 140
TGTGAAGTTG TGAAGTGTTT ACCAGTGACA GCACCAGAGA ATGGAAAAAT TGTCAGTAGT 480
C E V V K C L P V T A P E N G K I V S S 160
GCAATGGAAC CAGATCGGGA ATACCATTTT GGACAAGCAG TACGGTTTGT ATGTAACTCA 540
A M E P D R E Y H F G Q A V R F V C N S 180
GGCTACAAGA TTGAAGGAGA TGAA
GAAATG CATTGTTCAG ACGATGGTTT TTGGAGTAAA 600
G Y K I E G D E E M H C S D D G F W S K 200
GAGAAACCAA AGTGTGTGGA AATTTCATGC AAATCCCCAG ATGTTATAAA TGGATCTCCT 660
E K P K C V E I S C K S P D V I N G S P 220
ATATCTCAG
A AGATTATTTA TAAGGAGAAT GAACG
ATTTC AATATAAATG TAACATGGGT 720
I S Q K I I Y K E N E R F Q Y K C N M G 240
TATGAATACA GTGAAAGA
GG AGATGCTGTA TGCACTGAAT CTGGATGGCG TC
CGTTGCCT 780
Y E Y S E R G D A V C T E S G W R P L P 260
TCATGTGAAG AAAAATCATG TGATAATCCT TATATTCCAA ATGGTGACTA CTCACCTTTA 840
S C E E K S C D N P Y I P N G D Y S P L 280
AGGATTAAAC ACAGAACTGG AGATGAAATC ACGTACCAGT GTAGAAATGG TTTTTATCCT 900
R I K H R T G D E I T Y Q C R N G F Y P 300
GCAACCCGGG GAAATACAGC
CAAATGCACA AGTACTGGCT GGATACCTGC TCCGAGATGT 960
A T R G N T A K C T S T G W I P A P R C 320
ACCTTGAAAC CTTGTGATTA TCCAGACATT AAACA
TGGAG GTCTATATCA TGAGAATATG 1020
T L K P C D Y P D I K H G G L Y H E N M 340
CGTAGACCAT ACTTTCCAGT AGCTGTAGGA AAATATT
ACT CCTATTACTG TGATGAACAT 1080
R R P Y F P V A V G K Y Y S Y Y C D E H 360
TTTGAGACTC CGTCAGGAAG TTACTGGGAT CACATTCATT GCACACAAGA TGGATGGTCG 1140
F E T P S G S Y W D H I H C T Q D G W S 380
CCAGCAGTAC CATGCCTCAG AAAATGTTAT TTTCCTTATT TGGAAAATGG ATATAAT
CAA 1200
P A V P C L R K C Y F P Y L E N G Y N Q 400
AAT
CATGGAA GAAAGTTTGT ACAGGGTAAA TCTATAGACG TTGCCTGCCA TCCTGGCTAC 1260
N H G R K F V Q G K S I D V A C H P G Y 420
GCTCTTCCAA AAGCGCAGAC CACAGTTACA T
GTATGGAGA ATGGCTGGTC TCCTACTCCC 1320
A L P K A Q T T V T C M E N G W S P T P 440
AGATGCATCC GTGTCAAAAC ATGTTCCAAA TCAAGTATAG ATATTGAGAA TGGGTTTATT 1380
R C I R V K T C S K S S I D I E N G F I 460
TCTGAATCTC AGTATACATA TGCCTTAAAA GAAAAAGC
GA AATATCAATG CAAACTAGGA 1440
S E S Q Y T Y A L K E K A K Y Q C K L G 480
TATGTAACAG CAGATGGTGA AACATCAGGA TCAATTA
GAT GTGGGAAAGA TGG
ATGGTCA 1500
Y V T A D G E T S G S I R C G K D G W S 500
GCTCAACCCA CGTGCATTAA ATCTTGTGAT ATCCCAGTAT TTATGAATGC CAGA
ACTAAA 1560
A Q P T C I K S C D I P V F M N A R T K 520
AATGACTTCA CATGGTTTAA GCTGAATGAC ACATTGGACT ATGAATGCCA TGATGGTTAT 1620
N D F T W F K L N D T L D Y E C H D G Y 540
GAAAGCAATA CTGGAA
GCAC CACTGGTTCC ATAGTGTGTG GTTACAATGG TTGGTCTGAT 1680
E S N T G S T T G S I V C G Y N G W S D 560
TTACCCATAT GTTATGAAAG AGAATGCGAA CTTCCTAAAA TAGATGTACA CTTAGTTCCT 1740
L P I C Y E R E C E L P K I D V H L V P 580
GATCGCAAGA AAGACCAGTA TAAAGTTGGA GAGGTGTTGA AATTCTCCTG CAAACCAGGA 1800
D R K K D Q Y K V G E V L K F S C K P G 600
TTTACAATAG TTGGACCTAA TTCCGTTCAG TGCTACCACT TTGGATTGTC TCCTGACCTC 1860
F T I V G P N S V Q C Y H F G L S P D L 620
CCAATATGTA AAGAGCAAGT ACAATCATG
T GGTCCACCTC CTGAACTCCT CAATGGGAAT 1920
P I C K E Q V Q S C G P P P E L L N G N 640
GTTAAGGAAA AAACGAAAGA AGAATATGGA CACAGTGAAG TGGTGGAATA TTATTGCAAT 1980
V K E K T K E E Y G H S E V V E Y Y C N 660
CCTAGATTTC TAATGAAGGG ACCTAATAAA ATTCA
ATGTG TTGATGGAGA GTGGACAACT 2040
P R F L M K G P N K I Q C V D G E W T T 680
TTACCAGTGT GTATTGTGGA GGAGAGTACC TGTGGAGATA TACCTGAACT TGAACATGGC 2100
L P V C I V E E S T C G D I P E L E H G 700
TGGGCCCAGC TTTCTTCCCC TCCTTATTAC TATGGAGATT
CAGTGGAATT CAATTGCTCA 2160
W A Q L S S P P Y Y Y G D S V E F N C S 720
GAATCATTTA CAATGATTGG ACACAGATCA ATTACGTGTA TTCATGGAGT
ATGGACCCAA 2220
E S F T M I G H R S I T C I H G V W T Q 740
CTTCCCCAGT GTGTGGCAAT AGATAAACTT AAGAAGTGCA AATCATCAAA TTTAATTATA 2280
L P Q C V A I D K L K K C K S S N L I I 760
CTTGAGGAAC ATTTAAAAAA CA
AGAAGGAA TTCGATCATA ATTCTAACAT AAGGTACA
GA 2340
L E E H L K N K K E F D H N S N I R Y R 780
TGTAGAGGAA AAGAAGGATG GATACACACA GTCTGCATAA ATGGAAGATG GGATCCAGAA 2400
C R G K E G W I H T V C I N G R W D P E 800
GTGAACTGCT CAATGGCACA AATACAATTA TGCCCACCTC CACCTCAGAT TCCCAATTCT 2460
V N C S M A Q I Q L C P P P P Q I P N S 820
CACAATATGA CAACCACACT GAATTATCGG GATGGAGAAA AA
GTATCTGT TCTTTGCCAA 2520
H N M T T T L N Y R D G E K V S V L C Q 840
GAAAATTATC TAATTCAGGA AGGAGAA
GAA ATTACATGCA AAGATGGAAG ATGGCAGTCA 2580
E N Y L I Q E G E E I T C K D G R W Q S 860
ATACCACTCT GTGTTGAAAA AATTCCATGT TCACAACCAC CTCAGATAGA ACA
CGGAACC 2640
I P L C V E K I P C S Q P P Q I E H G T 880
ATTAATTCAT CCAGGTCTTC ACAAGAAAGT TATGCAC
ATG GGACT
AAATT GAGTTATACT 2700
I N S S R S S Q E S Y A H G T K L S Y T 900
TGTGAGGGTG GTTTCAGGAT ATCTGAAGAA AATGAAACAA CA
TGCTACAT GGGAAAATGG 2760
C E G G F R I S E E N E T T C Y M G K W 920
AGTTCTCCAC CT
CAGTGTGA AGGCCTTCCT TGTAAATCTC CACCTGA
GAT TTCTCATGGT 2820
S S P P Q C E G L P C K S P P E I S H G 940
GTTGTAGCTC ACATGTCAGA CAGTTATCA
G TATGGAGAAG AAGTTA
CGTA CAAAT
GTTTT 2880
V V A H M S D S Y Q Y G E E V T Y K C F 960
GAAGGTTTTG GAATTGATGG GCCTGCA
ATT GCAAAAT
GCT TAGGAGAAAA ATG
GTCTCAC 2940
E G F G I D G P A I A K C L G E K W S H 980
CCTCCATCAT GCATAAAA
AC AGATTGTCTC AGTTTACCTA GCTTTGAAA
A TGCCATACCC 3000
P P S C I K T D C L S L P S F E N A I P 1000
ATGGGAGAGA AGAAGGAT
GT GTATAAG
GC
G G
GTGAGCAAG TGACTTACAC TTGTGCAACA 3060
M G E K K D V Y K A G E Q V T Y T C A T 1020
T
ATTACAAAA TGGATGGAGC CAGTAATGTA ACATGCATTA ATAGCAGATG GACAGGAAGG 3120
Y Y K M D G A S N V T C I N S R W T G R 1040
CCAACA
TGCA GAGACAC
CTC CTGTGTG
AAT CCGCCCACAG TACAAAATGC TTATATA
GTG 3180
P T C R D T S C V N P P T V Q N A Y I V 1060
TCGAGACAGA TGAGTAAATA TCCATCTGGT GAGAGAGTAC GTTAT
CAATG TAGGAGCCCT 3240
S R Q M S K Y P S G E R V R Y Q C R S P 1080
TATGAAATGT TTGGGGATGA AGAAGTGATG TGTTTAAATG GAAACTGGAC
GGAACCACCT 3300
Y E M F G D E E V M C L N G N W T E P P 1100
CAATGCAAAG ATTCTACAGG AAAATGTGGG CCCCCTCCAC CTATTGACAA TGGGG
ACATT 3360
Q C K D S T G K C G P P P P I D N G D I 1120
ACTTCATTCC CGTTGTCAGT ATATGCTCCA GCTTCATCAG
TTGAGTACCA ATGCCAGAAC 3420
T S F P L S V Y A P A S S V E Y Q C Q N 1140
TTG
TAT
CAAC TTGAGGGTAA CAAGCGAATA ACATGTAGAA ATGGACAA
TG GTCAGAACCA 3480
L Y Q L E G N K R I T C R N G Q W S E P 1160
CCAAA
ATG
CT TA
CATCCGTG TGTAATATCC CGA
GAAATTA TGGAAAATTA TAACATAGCA 3540
P K C L H P C V I S R E I M E N Y N I A 1180
TTAAG
GTGGA
CAGCCAAACA GAAG
CTTTAT T
CGAGAACAG
GTGAATCAG
T T
GAAT
TTGTG 3600
L R W T A K Q K L Y S R T G E S V E F V 1200
TGTAAACGGG
GATATCGTC
T TTCATCA
CGT T
CTCACACAT TG
CGA
ACAAC ATGTTGGGAT 3660
C K R G Y R L S S R S H T L R T T C W D 1220
GGGAAACTGG AGT
AT
CCAAC TTGTGCAAAA AGAT
AGAATC AATCATAAAG TGCACACCTT 3720
G K L E Y P T C A K R 1240
TATTCAGAAC TTTAGTATTA AATCAGTTCT CAATTTCATT TTTTATGTAT TGTTTTACTC 3780
1260
CTTTTTATTC ATACGTAAAA TTTTGGATTA ATTTGTGAAA ATGTAATTAT AAGCTGAGAC 3840
1280
CGGTGGCTCT CTT                                                     3853
                                                     1284
FI Protein Sequence
MKLLHVFLLF 
LCFHLRFCKV 
TYTSQEDLVE 
KKCLAKKYTH 
LSCDKVFCQP 
WQRCIEGTCV  60
CKLPYQCPKN 
GTAVCATNRR 
SFPTYCQQKS 
LECLHPGTKF 
LNNGTCTAEG 
KFSVSLKHGN  120
TDSEGIVEVK 
LVDQDKTMFI 
CKSSWSMREA 
NVACLDLGFQ 
QGADTQRRFK 
LSDLSINSTE  180
CLHVHCRGLE 
TSLAECTFTK 
RRTMGYQDFA 
DVVCYTQKAD 
SPMDDFFQCV 
NGKYISQMKA  240
CDGINDCGDQ 
SDELCCKACQ 
GKGFHCKSGV 
CIPSQYQCNG 
EVDCITGEDE 
VGCAGFASVA  300
QEETEILTAD 
MDAERRRIKS 
LLPKLSCGVK 
NRMHIRRKRI 
VGGKRAQLGD 
LPWQVAIKDA  360
SGITCGGIYI 
GGCWILTAAH 
CLRASKTHRY 
QIWTTVVDWI 
HPDLKRIVIE 
YVDRIIFHEN  420
YNAGTYQNDI 
ALIEMKKDGN 
KKDCELPRSI 
PACVPWSPYL 
FQPNDTCIVS 
GWGREKDNER  480
VFSLQWGEVK 
LISNCSKFYG 
NRFYEKEMEC 
AGTYDGSIDA 
CKGDSGGPLV 
CMDANNVTYV  540
WGVVSWGENC 
GKPEFPGFYT 
KVANYFDWIS 
YHVGRPFISQ 
YNV FI Nucleotide Sequence
ATGAAGCTTC TTCATGTTTT CCTGTTATTT CTGTGCTTCC ACTTAAGGTT TTGCAAGGTC 60
M K L L H V F L L F L C F H L R F C K V 20
ACTTATACAT CTCAAGAGGA TCTGGTGGAG AAAAAGTGCT TAGCAAAAAA ATATACTCAC 120
T Y T S Q E D L V E K K C L A K K Y T H 40
CTCTCCTGCG ATAAAGTCTT CTGCCAGCCA TGGCAGAGAT GCATTGAGGG CACCTGTGTT 180
L S C D K V F C Q P W Q R C I E G T C V 60
TGTAAACTAC CGTATCAGTG CCCAAAGAAT GGCACTGCAG TGTGTGCAAC TAACAGGAGA 240
C K L P Y Q C P K N G T A V C A T N R R 80
AGCTTCCCAA CATACTGTCA ACAAAAGAGT TTGGAATGTC TTCATCCAGG GACAAAGTTT 300
S F P T Y C Q Q K S L E C L H P G T K F 100
TTAAATAACG GAACATGCAC AGCCGAAGGA AAGTTTAGTG TTTCCTTGAA GCATGGAAAT 360
L N N G T C T A E G K F S V S L K H G N 120
ACAGATTCAG AGGGAATAGT TGAAGTAAAA CTTGTGGACC AAGATAAGAC AATGTTCATA 420
T D S E G I V E V K L V D Q D K T M F I 140
TGCAAAAGCA GCTG
GAGCAT GAGGGAAGCC AACGTGGCCT GCCTTGACCT TGGGTTTCAA 480
C K S S W S M R E A N V A C L D L G F Q 160
CAAGGTGCTG ATACTCAAAG AAGGTTTAAG TTGTCTGATC TCTCTATAAA TTCCACTGAA 540
Q G A D T Q R R F K L S D L S I N S T E 180
TGTCTACATG TGCATTGCCG AGGATTAGAG ACCAGTTTGG CTGAATGTAC TTTTACTAAG 600
C L H V H C R G L E T S L A E C T F T K 200
AGAAGAACTA TGGGTTACCA GGATTTCGCT GATGTGGTTT GTTATACACA GAAAGCAGAT 660
R R T M G Y Q D F A D V V C Y T Q K A D 220
TCTCCAATGG ATGACTTCTT TCAGTGTGTG AATGGGAAAT ACATTTCTCA GATGAAAG
CC 720
S P M D D F F Q C V N G K Y I S Q M K A 240
TGTGATGGTA TCAATGAT
TG TGGAGACCAA AGTGATGAAC TGTGTTGTAA A
GCATGCCAA 780
C D G I N D C G D Q S D E L C C K A C Q 260
G
GCAAAGGCT TCCATTGCAA ATC
GGGTGTT TGCATTCCAA GCCAGTATCA ATGCAATGGT 840
G K G F H C K S G V C I P S Q Y Q C N G 280
GAGGTGGACT GCATTACAGG GGAAGATGAA GTTGGCTGTG CAGGCTTTGC AT
CTGTG
GCT 900
E V D C I T G E D E V G C A G F A S V A 300
CAAGAA
GAAA CAGAAATTTT GACTGCTGAC ATGGATGCAG AAAGAAGA
CG GATAAAATCA 960
Q E E T E I L T A D M D A E R R R I K S 320
TTATTACCTA AACTATCTTG TGGAGTTAAA AACAGAATGC ACATTCGAAG GAAACGAATT 1020
L L P K L S C G V K N R M H I R R K R I 340
GTGGGAGGAA AGCGAGCACA ACTGGGAGAC CTCCCATGGC AGGTGGCAAT TAAGGATGCC 1080
V G G K R A Q L G D L P W Q V A I K D A 360
AGTGGAATCA CCTGTGGGGG AATTTATATT GGTGGCTGTT GGATTCTGAC TGCTGCACAT 1140
S G I T C G G I Y I G G C W I L T A A H 380
TGTCTCAGAG CCAGTAAAAC TCATCGTTAC CAAATATGGA CAACAGTAG
T AGACTGGATA 1200
C L R A S K T H R Y Q I W T T V V D W I 400
CACCCCGACC TTAAAC
GTAT AGTAATTGAA TACGTGGATA GAATTATTTT CCATGAAAAC 1260
H P D L K R I V I E Y V D R I I F H E N 420
TACAATGCAG GCACTTACCA AAATGACATC GCTTTGATTG AAATGAAAAA AGACGGAAAC 1320
Y N A G T Y Q N D I A L I E M K K D G N 440
A
AAAAAGATT GTGAGCTGCC TCGTTCCATC CCTGCCTGTG TCCCCTGGTC TCCTTACCTA 1380
K K D C E L P R S I P A C V P W S P Y L 460
TTCCAACCTA ATGATACATG CATCGTTTCT GGCTGGGGAC GAGAAAAAGA TAACGAAAGA 1440
F Q P N D T C I V S G W G R E K D N E R 480
GTCTTTTCAC TTCAGTGGGG TGAAGTTAAA CTAATAAGCA ACTGCTCTAA GTTTTACGGA 1500
V F S L Q W G E V K L I S N C S K F Y G 500
AATCGTTTCT ATGAAAAAGA AATGGAATGT GCAGGTACAT ATGATGGTTC CATCGATGCC 1560
N R F Y E K E M E C A G T Y D G S I D A 520
TGTAAAGGGG
ACTCTGGAGG CCCCTTAGTC TGTATGGATG CCAACAATG
T GACTTATGTC 1620
C K G D S G G P L V C M D A N N V T Y V 540
TGGGGTGTTG TGAGTT
GGGG G
GAAAACTGT GGAAAACCAG AGTTCCCAGG TTTTTACACC 1680
W G V V S W G E N C G K P E F P G F Y T 560
AAAGTGGCCA ATTATTTTGA CTGGATTAGC T
ACCATGTAG GAAGGCCTTT TATTTCTCAG 1740
K V A N Y F D W I S Y H V G R P F I S Q 580
TACAATGTAT AAAATTGTGA TCTCTCTCTT CATTCTATTC TTTTTCTCTC AAGAGTTCCA 1800
Y N V 600
TTTAATGGAA ATAAAACGGT ATAATTAATA ATTCTCTAGG GGGGAAAAAT GAAGCAAATC 1860
620
TCATTGGATA TTTTTAAAGG TCTCCACAGA GTTTATGCCA TATTGGAATT TTGTTGTATA 1920
640
ATTCTCAAAT AAATATTTTG GTGAAGCAT                                    1949
                                    650
MCP Protein Sequence
MEPPGRRECP 
FPSWRFPGLL 
LAAMVLLLYS 
FSDACEEPPT 
FEAMELIGKP 
KPYYEIGERV  60
DYKCKKGYFY 
IPPLATHTIC 
DRNHTWLPVS 
DDACYRETCP 
YIRDPLNGQA 
VPANGTYEFG  120
YQMHFICNEG 
YYLIGEEILY 
CELKGSVAIW 
SGKPPICEKV 
LCTPPPKIKN 
GKHTFSEVEV  180
FEYLDAVTYS 
CDPAPGPDPF 
SLIGESTIYC 
GDNSVWSRAA 
PECCRF 
PVVENGKQIS  240
GFGKKFYYKA 
TVMFECDKGF 
YLDGDTSIVC 
DSNSTWDPPV 
PKCLKVLPPS 
STKPPALSHS  300
VSTSSTTKSP 
ASSASGPRPT 
YKPPVSNYPG 
YPKPEEGILD 
SLDVWVIAVI 
VIAIVVGVAV  360
ICVVPYRYLQ 
RRKKKGTYLT 
DETHREVKFT 
SL MCP Nucleotide Sequence
ATGGAGCCTC CCGGCCGCCG CGAGTGTCCC TTTCCTTCCT GGCGCTTTCC TGGGTTGCTT 60
M E P P G R R E C P F P S W R F P G L L 20
CTGGCGGCCA TGGTGTTGCT GCTGTACTCC TTCTCCGATG CCT
GTGAGGA GCCACCAACA 120
L A A M V L L L Y S F S D A C E E P P T 40
TTTGAAGCTA TGGAGCTCAT TGGTAAACCA AAACCCTACT ATGAGATTGG TGAACGAGTA 180
F E A M E L I G K P K P Y Y E I G E R V 60
GATTATAAGT G
TAAAAAAGG ATACTTCTAT ATACCTCCTC TTGCCACCCA TACTATTTGT 240
D Y K C K K G Y F Y I P P L A T H T I C 80
GATCGGAATC ATACATGGCT ACCTGTCTCA GATGACGCCT GTTA
TAGAGA AACA
TGTCCA 300
D R N H T W L P V S D D A C Y R E T C P 100
TATATA
CGGG ATCCTTTAAA TGGCCAAGCA GTCCCTGCAA ATGGGACTTA CGAGTTTGGT 360
Y I R D P L N G Q A V P A N G T Y E F G 120
TATCAGATGC ACTTTATTTG TAATGAGGGT TATTACTTAA TTGGTGAAGA AATTCT
ATAT 420
Y Q M H F I C N E G Y Y L I G E E I L Y 140
TGTGAACTTA AAGGATCAGT AGCAATTTGG AGCGGTAAGC CCCCAATATG TGAAAAGGTT 480
C E L K G S V A I W S G K P P I C E K V 160
TTGTGTACAC CACCTCCAAA AATAAAAAAT GGAAAACACA CCTTTAGTGA AGTAGAAGTA 540
L C T P P P K I K N G K H T F S E V E V 180
TTTGAGTATC TT
GATGCAGT AACT
TATAGT TGTGATCCTG CACCT
GGACC AGATCCATTT 600
F E Y L D A V T Y S C D P A P G P D P F 200
TCACTTATTG GAGAGAGCAC GATTTATTGT GGTGACAATT CAGTGTGGAG TCGTGCTGCT 660
S L I G E S T I Y C G D N S V W S R A A 220
CCAGAGTGTA AAGTGGTCAA ATGTCGATTT CCAGTAGTCG AAAATGGAAA ACAGATA
TCA 720
P E C C R F P V V E N G K Q I S 240
GGAT
TTGGAA AAAAATTTTA CTA
CAA
AGCA ACAGTTATGT TTGAATGCGA TAAGGGTTTT 780
G F G K K F Y Y K A T V M F E C D K G F 260
TACCTCGATG GCAGC
GAC
AC AATTGTCTGT
GACA
GTAACA GTACTTGGGA TCCCCCAGTT 840
Y L D G D T S I V C D S N S T W D P P V 280
CCAAAGTGTC TTAAAGTGCT GCCTCCATCT AGTACAAAAC CTCCAGCTTT GAGTCATTCA 900
P K C L K V L P P S S T K P P A L S H S 300
GTGTC
GACTT CTTCCACTAC AAAATCTCCA GCGTCCAGTG CCTCAGGTCC TAGGCCTACT 960
V S T S S T T K S P A S S A S G P R P T 320
TACAAGCCTC CAGTCTCAAA TTATCCAGGA TATCCTAAAC CTGAGGAAGG AATACTTGAC 1020
Y K P P V S N Y P G Y P K P E E G I L D 340
AGTTTGGATG TTTGGGTCAT TGCTGTGATT GTTATTG
CCA TAGTTGTTGG AGTTGCAGTA 1080
S L D V W V I A V I V I A I V V G V A V 360
ATTTGTGTTG TCCCGTACAG ATATCTTCAA AGGAGGAAGA AGAAAG
GCAC ATACCTAACT 1140
I C V V P Y R Y L Q R R K K K G T Y L T 380
GATGAGACCC ACAGAGAAGT AAAATTTACT TCTCTCTGAG AAGGAGAGAT GAGAGAAAGG 1200
D E T H R E V K F T S L 400
TTTGCTTTTA TCATTAAAAG GAAAGCAGAT GGTGGAGCTG AATATGCCAC TTACCAGACT 1260
420
AAATCAACCA CTCCAGCAGA GCAGAGAGGC TGAATAGATT CCACAACCTG GTTTGCCAGT 1320
440
TCATCTTTTG ACTCTATTAA AATCTTCAAT AGTTGTTATT CTGTAGTTTC ACTCTCATGA 1380
460
GTGCAACTGT GGCTTAGCTA ATATTGCAAT GTGGCTTGAA TGTAGGTAGC ATCCTTTGAT 1440
480
GCTTCTTTGA AACTTGTATG AATTTGGGTA TGAACAGATT GCCTGCTTTC CCTTAAATAA 1500
500
CACTTAGATT TATTGGACCA GTCAGCACAG CATGCCTGGT TGTATTAAAG CAGGGATATG 1560
520
CTGTATTTTA TAAAATTGGC AAAATTAGAG AAATATAGTT CACAATGAAA TTATATTTTC 1620
540
TTTGTAAAGA AAGTGGCTTG AAATCTTTTT TGTTCAAAGA TTAATGCCAA CTCTTAAGAT 1680
560
TATTCTTTCA CCAACTATAG AATGTATTTT ATATATCGTT CATTGTAAAA AGCCCTTAAA 1740
580
AATATGTGTA TACTACTTTG GCTCTTGTGC ATAAAAACAA GAACACTGAA AATTGGGAAT 1800
600
ATGCACAAAC TTGGCTTCTT TAACCAAGAA TATTATTGGA AAATTCTCTA AAAGTTAATA 1860
620
GGGTAAATTC TCTATTTTTT GTAATGTGTT CGGTGATTTC AGAAAGCTAG AAAGTGTATG 1920
640
TGTGGCATTT GTTTTCACTT TTTAAAACAT CCCTAACTGA TCGAATATAT CAGTAATTTC 1980
660
AGAATCAGAT GCATCCTTTC ATAAGAAGTG AGAGGACTCT GACAGCCATA ACAGGAGTGC 2040
680
CACTTCATGG TGCGAAGTGA ACACTGTAGT CTTGTTGTTT TCCCAAAGAG AACTCCGTAT 2100
700
GTTCTCTTAG GTTGAGTAAC CCACTCTGAA TTCTGGTTAC ATGTGTTTTT CTCTCCCTCC 2160
720
TTAAATAAAG AGAGGGGTTA AACATGCCCT CTAAAAGTAG GTGGTTTTGA AGAGAATAAA 2220
740
TTCATCAGAT AACCTCAAGT CACATGAGAA TCTTAGTCCA TTTACATTGC CTTGGCTAGT 2280
760
AAAAGCCATC TATGTATATG TCTTACCTCA TCTCCTAAAA GGCAGAGTAC AAAGTA
AGCC 2340
780
ATGTATCTCA GGAAGGTAAC TTCATTTTGT CTATTTGCTG TTGATTGTAC CAAGGGATGG 2400
800
AAGAAGTAAA TATAGCTCAG GTAGCACTTT ATACTCAGGC AGATCTCAGC CCTCTACTGA 2460
820
GTCCCTTAGC CAAGCAGTTT CTTTCAAAGA AGCCAGCAGG CGAAAAGCAG GGACTGCCAC 2520
840
TGCATTTCAT ATCACACTGT TAAAAGTTGT GTTTTGAAAT TTTATGTTTA GTTGCACAAA 2580
860
TTGGGCCAAA GAAACATTGC CTTGAGGAAG ATATGATTGG AAAATCAAGA GTGTAGAAGA 2640
880
ATAAATACTG TTTTACTGTC CAAAGACATG TTTATAGTGC TCTGTAAATG TTCCTTTCCT 2700
900
TTGTAGTCTC TGGCAAGATG CTTTAGGAAG ATAAAAGTTT GAGGAGAACA AACAGGAATT 2760
920
CTGAATTAAG CACAGAGTTG AAGTTTATAC CCGTTTCACA TGCTTTTCAA GAATGTCGCA 2820
940
ATTACTAAGA AGCAGATAAT GGTGTTTTTT AGAAACCTAA TTGAAGTATA TTCAACCAAA 2880
960
TACTTTAATG TATAAAATAA ATATTATACA ATATACTTGT ATAGCAGTTT CTGCTTCACA 2940
980
TTTGATTTTT TCAAATTTAA TATTTATATT AGAGATCTAT ATATGTATAA ATATGTATTT 3000
1000
TGTCAAATTT GTTACTTAAA TATATAGAGA CCAGTTTTCT CTGGAAGTTT GTTTAAATGA 3060
1020
CAGAAGCGTA TATGAATTCA AGAAAATTTA AGCTGCAAAA ATGTATTTGC TATAAAATGA 3120
1040
GAAGTCTCAC TGATAGAGGT TCTTTATTGC TCATTTTTTA AAAAATGGAC TCTTGAAATC 3180
1060
TGTTAAAATA AAATTGTACA TTTGGAGATG TTTCA                             3215
                             1072