This page is concerned with a method for determining which
family a gene belongs to. Genes names are of the form XXXXn, where XXXX
is the recognized symbol for a gene family and n is a number, given in
order of gene discovery. Related issues may be found on the
Nomenclature Page.
ROBUST PROCEDURE:
The recommended procedure for identifying a gene's family is
based on phylogeny. Constructing phylogenies can be a bewildering world for the
new comer. See
PHYLOGENY PROGAMS for links to many methods. Here is our protocol in
brief:
-
Make an alignment of the new gene with representative
sequences from the expansin superfamily.
-
Inspect the alignment to make sure that it makes sense; if
not, adjust the alignment parameters or pick a different method or adjust by
hand. CLUSTAL W with default parameters works well in many, but not all cases.
Click HERE for a representative alignment. CLUSTAL W
is found in many software packages or you may try a
CLUSTAL W Web server.
-
Construct a phylogeny using established sequenced-based
methods, e.g. neighbor joining. There are many methods and programs for doing
this, such as
MEGA,
PHYLIP,
PAUP and, regretably for the neophyte,
numerous others. Some web servers are listed
here. My recommendation: use protein sequences with neighbor joining.
-
Examine the phylogeneic tree to valuate whether the new gene
falls within established families or is something new. If it falls outside the
established families,
special rules may apply. For genes that are
close to the root of the family, some bootstrap analysis and further work may
be needed. In most cases, however, the answer is clear.
QUICK AND DIRTY PROCEDURE:
To make life easier for those not routinely constructing
phylogenies, here is some help:
-
Add your protein sequence, in FASTA format, to the list of
representative sequences given HERE.
-
Submit the sequences to this
web site, which will return
to you a phylogeny of the sequences, based on neighbor joining. Choose the
Postscript option. A tree will be returned in Postscript (PS) format, which may be read by various graphics programs, including Acrobat
Professional and Adobe Illustrator. The downside of this web server is that the alignment is not
returned for inspection and no bootstrap values are given, so be cautious about bizarre results.
-
Determine whether your gene falls within the families
designated by the colored boxes in this
phylogeny. If the answer is clear, you are done. If not, then additional
analysis and discussion may be needed.
Representative sequences in FASTA format (with signal peptides removed):
>AtEXPA1
MTSHVNGYAGGGWVNAHATFYGGGDASGTMGGACGYGNLYSQGYGTNTAALSTALFNNGLSCGACFEIRCQNDGKWCLPGSIVVTATNFCPPNNALPNNAGGWCNPPQQHFDLSQPVFQRIAQYRAGIVPVAYRRVPCVRRGGIRFTINGHSYFNLVLITNVGGAGDVHSAMVKGSRTGWQAMSRNWGQNWQSNSYLNGQSLSFKVTTSDGQTIVSNNVANAGWSFGQTFTGAQLR
>AtEXPA2
DDNGGWERGHATFYGGADASGTMGGACGYGNLHSQGYGLQTAALSTALFNSGQKCGACFELQCEDDPEWCIPGSIIVSATNFCPPNFALANDNGGWCNPPLKHFDLAEPAFLQIAQYRAGIVPVAFRRVPCEKGGGIRFTINGNPYFDLVLITNVGGAGDIRAVSLKGSKTDQWQSMSRNWGQNWQSNTYLRGQSLSFQVTDSDGRTVVSYDVVPHDWQFGQTFEGGQF
>AtEXPA3
KIPGVYSGGPWQNAHATFYGGSDASGTMGGACGYGNLYSQGYGVNTAALSTALFNNGFSCGACFEIKCTDDPRWCVPGNPSILVTATNFCPPNFAQPSDDGGWCNPPREHFDLAMPMFLKIGLYRAGIVPVSYRRVPCRKIGGIRFTVNGFRYFNLVLVTNVAGAGDINGVSVKGSKTDWVRMSRNWGQNWQSNAVLIGQSLSFRVTASDRRSSTSWNVAPATWQFGQTFSGKNFRV
>AtEXPA4
RIPGIYSGGAWQNAHATFYGGSDASGTMGGACGYGNLYSQGYGTNTAALSTALFNNGMSCGACFELKCANDPQWCHSGSPSILITATNFCPPNLAQPSDNGGWCNPPREHFDLAMPVFLKIAQYRAGIVPVSYRRVPCRKRGGIRFTINGHRYFNLVLITNVAGAGDIVRASVKGSRTGWMSLSRNWGQNWQSNAVLVGQALSFRVTGSDRRTSTSWNMVPSNWQFGQTFVGKNFRV
>AtEXPA5
GYRRGGHHPGGHMGPWINAHATFYGGGDASGTMGGACGYGNLYSQGYGLETAALSTALFDQGLSCGACFELMCVNDPQWCIKGRSIVVTATNFCPPGGACDPPNHHFDLSQPIYEKIALYKSGIIPVMYRRVRCKRSGGIRFTINGHSYFNLVLVTNVGGAGDVHSVSMKGSRTKWQLMSRNWGQNWQSNSYLNGQSLSFVVTTSDRRSVVSFNVAPPTWSFGQTYTGGQFRY
>AtEXPA6
RIPGVYNGGGWETAHATFYGGSDASGTMGGACGYGNLYSQGYGVNTAALSTALFNNGFSCGACFELKCASDPKWCHSGSPSIFITATNFCPPNFAQPSDNGGWCNPPRPHFDLAMPMFLKIAEYRAGIVPVSFRRVPCRKRGGIRFTINGFRYFNLVLVTNVAGAGNIVRLGVKGTHTSWMTMSRNWGQNWQSNSVLVGQSLSFRVTSSDRRSSTSWNIAPANWKFGQTFMGKNFRV
>AtEXPA7
YYRPGPWRYAHATFYGDETGGETMGGACGYGNLFNSGYGLSTAALSTTLFNDGYGCGQCFQITCSKSPHCYSGKSTVVTATNLCPPNWYQDSNAGGWCNPPRTHFDMAKPAFMKLAYWRAGIIPVAYRRVPCQRSGGMRFQFQGNSYWLLIFVMNVGGAGDIKSMAVKGSRTNWISMSHNWGASYQAFSSLYGQSLSFRVTSYTTGETIYAWNVAPANWSGGKTYKSTANFR
>AtEXPA8
DDGGWQGGHATFYGGEDASGTMGGACGYGNLYGQGYGTNTAALSTALFNNGLTCGACYEMKCNDDPRWCLGSTITVTATNFCPPNPGLSNDNGGWCNPPLQHFDLAEPAFLQIAQYRAGIVPVSFRRVPCMKKGGIRFTINGHSYFNLVLISNVGGAGDVHAVSIKGSKTQSWQAMSRNWGQNWQSNSYMNDQSLSFQVTTSDGRTLVSNDVAPSNWQFGQTYQGGQF
>AtEXPA9
KIPGVYTGGPWINAHATFYGEADASGTMGGACGYGNLYSQGYGVNTAALSTALFNNGLSCGSCFELKCINDPGWCLPGNPSILITATNFCPPNFNQASDNGGWCNPPREHFDLAMPMFLSIAKYKAGIVPVSYRRIPCRKKGGIRFTINGFKYFNLVLVTNVAGAGDVIKVSVKGSNTQWLDLSRNWGQNWQSNALLVGQSLSFRVKTSDGRSSTSNNIAPSNWQFGQTYSGKNFRV
>AtEXPA10
YGGGWINAHATFYGGGDASGTMGGACGYGNLYSQGYGTSTAALSTALFNNGLSCGSCFEIRCENDGKWCLPGSIVVTATNFCPPNNALANNNGGWCNPPLEHFDLAQPVFQRIAQYRAGIVPVSYRRVPCRRRGGIRFTINGHSYFNLVLITNVGGAGDVHSAAIKGSRTVWQAMSRNWGQNWQSNSYLNGQALSFKVTTSDGRTVVSFNAAPAGWSYGQTFAGGQFR
>AtEXPA11
PSGLTNGHATFYGGSDASGTMGGACGYGDLYSAGYGTMTAALSTALFNDGASCGECYRITCDHAADSRWCLKGASVVITATNFCPPNFALPNNNGGWCNPPLKHFDMAQPAWEKIGIYRGGIVPVVFQRVSCYKKGGVRFRINGRDYFELVNIQNVGGAGSIKSVSIKGSKTGWLAMSRNWGANWQSNAYLDGQALSFSITTTDGATRVFLNVVPSSWSFGQIYSSNVQF
>AtEXPA12
SNGWIRAHATYYGVNDSPASLGGACGYDNPYHAGFGAHTAALSGELFRSGESCGGCYQVRCDFPADPKWCLRGAAVTVTATNFCPTNNNNGWCNLPRHHFDMSSPAFFRIARRGNEGIVPVFYRRVGCKRRGGVRFTMRGQGNFNMVMISNVGGGGSVRSVAVRGSKGKTWLQMTRNWGANWQSSGDLRGQRLSFKVTLTDSKTQTFLNVVPSSWWFGQTFSSRGRQFV
>AtEXPA13
HYSSSTSSPSSSSVSSDASEWRPARATYYAATNPRDAVGGACGYGDLVKSGYGMATVGLSETLFERGQICGACFELRCVDDLRWCIPGTSIILTATNFCAPNYGFDPDGGGHCNPPNKHFVLPIEAFEKIAIWKAGNMPVQYRRINCRKEGSMRFTVDGGGIFISVLITNVAGSGDIAAVKIKGSRTGWLPMGRNWGQNWHINADLRNQALSFEVTSSDRSXVTSYNVSPKNWNYGQTFEGKQFETP
>AtEXPA14
YSSGWVNARATFYGGADASGTMGGACGYGNLYSQGYGTNTAALSTALFNGGQSCGACFQIKCVDDPKWCIGGTITVTGTNFCPPNFAQANNAGGWCNPPQHHFDLAQPIFLRIAQYKAGVVPVQYRRVACRRKGGIRFTINGHSYFNLVLITNVAGAGDVISVSIKGTNTRWQSMSRNWGQNWQSNAKLDGQALSFKVTTSDGRTVISNNATPRNWSFGQTYTGKQFRAQR
>AtEXPA15
YDAGWVNAHATFYGGSDASGTMGGACGYGNLYSQGYGTNTAALSTALFNNGLSCGACFEIKCQSDGAWCLPGAIIVTATNFCPPNNALPNNAGGWCNPPLHHFDLSQPVFQRIAQYKAGVVPVSYRRVPCMRRGGIRFTINGHSYFNLVLVTNVGGAGDVHSVAVKGSRTRWQQMSRNWGQNWQSNNLLNGQALSFKVTASDGRTVVSNNIAPASWSFGQTFTGRQFR
>AtEXPA16
GIPRVFSGGSWQTAHATFYGGNDASGTMGGACGYGNLYSQGYGTNTAALSTSLFNSGQSCGACFEIKCVNDPKWCHPGNPSVFVTATNFCPPNLAQPSDNGGWCNPPRSHFDLAMPVFLKIAEYRAGIVPISYRRVACRKSGGIRFTINGHRYFNLVLITNVAGAGDIARTSVKGSKTGWMSLTRNWGQNWQSNAVLVGQSLSFRVTSSDRRTSTSWNIAPSNWQFGQTFVGKNFRV
>AtEXPA17
GWLQAHATFYGGSDASGTMGGACGYGNLYTDGYKTNTAALSTALFNDGKSCGGCYQILCDATKVPQWCLKGKSITITATNFCPPNFAQASDNGGWCNPPRPHFDMAQPAFLTIAKYKAGIVPILYKKVGCRRSGGMRFTINGRNYFELVLISNVAGGGEISKVWIKGSKSNKWETMSRNWGANYQSNTYLNGQSLSFKVQLSDGSIKAALNVVPSNWRFGQSFKSNVNF
>AtEXPA18
TYAGTPWRTASATFYGDDTGSATMGGACGYGNMYDSGYGVATTALSTALFNEGYACGQCFQLKCVSSPNCYYGSPATVVTATNICPPNYGQASNNGGWCNPPRVHFDLTKPAFMKIANWKAGIIPVSYRRVACKKIGGIRFKFEGNGYWLLVYVMNVGGPGDIKTMAVKGSRTGWINMSHNWGASYQAFSSLYGQSLSFRLTSYTTRQTIYAYNAAPASWSAGKTYQSKANFN
>AtEXPA19
HVGLTNIDPSWYDAHATFYGDMSGGETMQGACGYGDLFKQGYGLETAALSTALFNNGQTCGACFELMCVSSKWCKPNAGSIKITATNFCPPNYQEPVQYHWCNPPNKHFDLSMKMFTTVAEYRAGIVPVKFRRVACHKRGGVRFEIKGNPYYIMVLVYNVGGAGDVSNVEIRGQKSNWIVMKRNWGQIWDTGLDLVGQSLSFIVRTSDGRSMTFFNVAPPNWGFGQTYEAKSNF
>AtEXPA20
EDDWKIATATLSRDRDGSSSVATGGACGYGDLRQSSFAGYSAGLSGKLFNRGSSCGACLEVRCVNHIRWCLQGSPSVVVTATDFCPPNSGLSSDYGGWCNFPKEHLELSHAAFTGIAETRAEMIPIQYRRVKCGRRGGLRFSLSGSSHFFQVLISNVGLDGEVVGVKVKGHTTAWIPMARNWGQNWHSSLDLIGQSLSFEVTLKGGKTIASYDVAPPYWRFGMTYQGKQFHS
>AtEXPA21
ANVAAAPGTNGLDTAWYDARAAYYGDIHGGGTELEGACGYGDLNKHGYGLATAALSTALFNSGASCGACYEIMCSPNPQGCLSGSIKITATDLCPPGSAWCYLPNKHFDLSLPMFIKIAQVKAKMVPVRYRRVPCAKTGGVKFEVKGNPNILTILPYNVGGAGDIIAVSAKGSKTAWVVMSRYWGQNWTTNVNLTGQSVSLRVTTSDGITKDFTDVMPASWGFGQTFDGKTNF
>AtEXPA22
HGAMIGNAVEAPDVAEAPGINDPSKALDTNWYDARATFYGDIHGGDTQQGACGYGNLFRQGYGLATAALSTALFNDGYTCGACYEIMCTRDPQWCLPGSVKITATNFCPANYSKTTDLWCNPPQKHFDLSLAMFLKIAKYKAGVVPVRYRRIPCSKTGGVKFETKGNPYFLMVLIYNVGGAGDIKYVQVKGNKTGWITMKKNWGQNWTTITVLTGQGLSFRVTTSDGITKDFWNVMPKNWGFGQTFDGRINF
>AtEXPA23
HRAMINDVAEAPVFDDVVSPNGLDSSWYDARATFYGDIHGGETQQGACGYGDLFKQGYGLETAALSTALFNEGYTCGACYQIMCVNDPQWCLPGSVKITATNFCPPDYSKTEGVWCNPPQKHFDLSLPMFLKIAQYKAGVVPVKYRRISCARTGGVKFETKGNPYFLMILPYNVGGAGDIKLMQVKGDKTGWITMQKNWGQNWTTGVNLTGQGISFRVTTSDGVTKDFNNVMPNNWGFGQTFDGKINF
>AtEXPA25
HRAMINDVAEAPVIDNVGSPTNGLDSSWYDARATFYGDIHGGETQQGACGYGDLFKQGYGLETAALSTALFNEGYTCGACYQIMCVHDPQWCLPGTIKITATNFCPPDYSKTEGVWCNPPQKHFDLSLPMFLKIAQYKAGVVPVKYRRISCARTGGVKFETKGNPYFLMILPYNVGGAGDIKLMQVKGDKTGWITMQKNWGQNWTTGVNLTGQGISFRVTTSDGVTKDFNNVMPNNWGFGQTFDGKINF
>AtEXPA26
HGAMIGNAVEAPDVAEAPGINDPSKALDPNWYDARATFYGDIHGGDTQQGACGYGNLFRQGYGLATAALSTALFNDGYTCGACYEIMCTRDPQWCLPGSVKITATNFCPANYSKTTDLWCNPPQKHFDLSLAMFLKIAKYKAGVVPVRYRRIPCSKTGGVKFETKGNPYFLMVLIYNVGGAGDIKYVQVKENKTGWITMKKNWGQNWTTSTVLTGQGLSFRVTTTDGITKDFWNVMPKNWGFGQTFDGKINF
>AtEXPB1
TPPLTHSNQQVAATRWLPATATWYGSAEGDGSSGGACGYGSLVDVKPFKARVGAVSPILFKGGEGCGACYKVRCLDKTICSKRAVTIIATDQSPSGPSAKAKHTHFDLSGAAFGHMAIPGHNGVIRNRGLLNILYRRTACKYRGKNIAFHVNAGSTDYWLSLLIEYEDGEGDIGSMHIRQAGSKEWISMKHIWGANWCIVEGPLKGPFSVKLTTLSNNKTLSATDVIPSNWVPKATYTSRLNFSPVL
>AtEXPB2
FSPKKFNISAATTSDSDWSIAGSTWYGNPTGYGSDGGACGYGNAVAQPPFSKMVSAGGPSLFKSGKGCGACYQVKCTSKSACSKNPVTVVITDECPGCVKESVHFDLSGTAFGAMAISGQDSQLRNVGELQILYKKVECNYIGKTVTFQVDKGSNANSFAVLVAYVNGDGEIGRIELKQALDSDKWLSMSQSWGAVWKLDVSSPLRAPLSLRVTSLESGKTVVASNVIPANWQPGAIYKSNVNF
>AtEXPB3
LATTNRHVSNSHWLPAVATWYGSPNGDGSDGGACGYGTLVDVKPLHARVGAVNPILFKNGEGCGACYKVRCLDKSICSRRAVTVIITDECPGCSKTSTHFDLSGAVFGRLAIAGESGPLRNRGLIPVIYRRTACKYRGKNIAFHVNEGSTDFWLSLLVEFEDGEGDIGSMHIRQAGAREWLEMKHVWGANWCIIGGPLKGPFSIKLTTLSAGKTLSATDVVPRNWAPKATYSSRLNFSPVL
>AtEXPB4
QNETIDVAGSGTAGVTWYGEPFGAGSTGGACGYGSAVANPPLYAMVSAGGPSLFNNGKGCGTCYQVVCIGHPACSGSPITVTITDECPGGPCASEPVHIDLSGKAMGALAKPGQADQLRSAGVIRVNYKRAACLYRGTNIVFRMDAGANPYYISFVVEYENGDGDLSNVEIQPAGGSFISMQEMRSAVWKVNSGSALRGPFNIRLTSGESHKVIVAYNVIPANWKPDESYRSIVNF
>AtEXPB5
HNKTHWNTAGITWYGDREGPGTTGGACGYGDAVAKHPYRCMVSAGGPSLFKDGKGCGACYRLKCDHPLCTKKPIKVMISDECPGCTKESVHFDLSGKAFGALAKRGKGDQLRNLGELKVSYKRACCKHPKTMIAIHVDAGANPYYMSFAVKFANGDGNFACIEVQPAGGQYMKMEEMRSAVWRLSPGVPLKGPFNIRLTSAVSGKKIIAKGVIPEKWSPGAIYHSKVNFPVQRKQK
>AtEXLA1
CDRCLHRSKAAYFSSASALSSGACAYGSMATSFFAGHIAAAIPSIYKDGAGCGACFQVRCKNPKLCSTKGTIVMITDLNKSNQTDLVLSSRAFRAMAKPIVGADKDLLKQGIVDIEYQRVPCDYGNKNMNVRVEEASKKPNYLEIKLLYQGGQTEVVSIDIAQVGSSPNWGYMTRSHGAVWVTDKVPTGAIQFRFVVTGGYDGKMIWSQSVLPSNWEAGKIYDAGVQITDIAQEGCDPCDAHIWN
>AtEXLA2
CDRCLHSSKAAYFSSASALSSGACAYGSMATGFFAGHIAAALPSIYKDGSGCGACFQVRCKNPTLCSSKGTTVIVTDLNKTNQTDLVLSSRAFRAMAKPVVGADRDLLKQGIVDIEYRRVPCDYGNKKMNVRVEESSKNPNYLAIKLLYQGGQTEVVAIYIAQVGSSHWSYMTRSHGAVWVTDKVPNGALQFRFVVTAGYDGKMVWSQRVLPANWEAGKSYDAGVQITDIAQEGCDPCDDHIWN
>AtEXLA3
CDRCLHRSKASYFSSASALSSGACAYGPMATSFFAGHIAAAIPSIYKDGAGCGACFQVRCKNPKLCNSKGTIVMVTDLNTSNQTDLVLSSRAFRAMAKPVVGVDKYLLKQGIVDVEYQRVPCNYGKRNLNVRVEEASKKPNYLAIKLLYQGGQTEVVGIDIAPVGSSQWSYMSRSHGAVWATDKVPTGALQFKFTVTGGYDGKTVWSKRVLPANWNSGRIYDAGVQITDIAQEGCDTCGHIWN
>AtEXLB1
DDFVNSRATYYGSPDCKANPRGHCGYGEFGRDINNGEVSGVSWRLWNNGTGCGACYQVRCKIPPHCSEEGVYVVATDSGEGDGTDFILSPKAYGRMARPGTENQLYSFGVVNVEYQRIPCRYAGYNLVYKIHEKSYNPHYLAILVLYVGGVNDILAVEVWQEDCKEWRRMRRVFGAVHDLQNPPRGTLTLRFLVYGSAGINWIQSPNAIPADWTAGATYDSNILLT
>OsEXLB1
DANFTVSRAAYYPNSDIKGTENGACEYGAFGATLNNGDVSASASLYRDGVGCGACYQVRCTNPYYCSPNGVTIVITDSGASDGTDFILSQHAFTRMAQSTDAGTALLTLGVVGIEYRRVSCTYPNKNIVFKITESSNFPNYLEFEIWYQQGNQDIIAVQLCETVNLTCQLLSRTHGAVWAAVSPPSGPLSIRMLFSSGAPRGGDTWLVPTNIVPQNWTAGATYDSGVQVQLQ
>mediTC77984
QDSFVCSRATYYGSPDCYANPKGACGYGDYGQTVNDGNVAGVSWLWKNGSGCGACYQVRCKIPELCDENGAYVVVTDFGVGDRTDFIMSPRGYSKLGKNGDASAELFKYGVVDIEYKRIPCKYNGYNILFKVHERSKNPHYLAILILYVGGTNDVTAVQLWQEDCKEWRPMRRAFGTVFDAENPPRGEIKLRLQVSGSAGLYWVESKNVISSDWEAGSVYDSQIQFD
>letTC982
SQDYYVSSRATYYGSPDCLGTPTGACGYKAYGSTINGGEVSGVSRLFKNGTGCGACYQVRCKSPKHCSENGVKVVVTDHGEGDNTDFILSTRAYSKLALPGLAEALFSYGVVDVEYKRIPCQYSGYNLMFKVHEHSRFPDYLSLIPIYQAGVSEITAVEIWQEDCQEWVCMRRAYGAVWDMPNPPRGQPEGAINVRIQVTGSSGSKWVQLKNLIPSAWKVGAAYDTSIQLD
>cotTC21359
QDYFVKSRATYYGSPDCLGTPSGACGFGEYGKSVNDANVAGVSRLYKNGTACGACYQVRCTNPQICADNGVNIVVTDYGEGDNTDFILSPPAYARMARPDTAAHLFAYGVVDVEYQRIPCQYSGYKTQVKVQEHSKYPNYLAIVVLYQAGKSEILSVDIWQEDCKEWIGMTRAYGDRFRHGKSAIGRHQF
>mediTC88001
CDRCLHQSKASYFSKASALSSGACGYGSLALDFSGGHLAAGVSSLFYNGAGCGACFQVRCKNQAICTKEGTKVVLTDLNHNNQTDFVLSSRAFTAMAQKGMSQQILKLGIVDIEYKRVPCEYKKQNLAVRVEESSKKPDYLAIKFLYQGGQTEIVGVDVAQVGSSNWSFLSRNHGAVWDTSRVPQGALQFRIVVTSGYDGKWLWAKKVLPADWKNGVIYDSDIQITEIAQEGCSPCNDETWS
>OsEXLA1
CDRCVRRSRAAYYTSSLTLTAGSCGYGTAAATFNGGGFLAAAGPALYRGGVGCGACYQVRCKDKKLCSNAGARVVVTDRARTNRTGLVLSSPAFAAMARPGMAASLTELAAVDVEYKRVPCEYRHRSLSVRVDERSRGPNELTISFLYQGGQTDIVAVDVAQVGSSSWKFMTREHGPSWSMANAPPGPLQMRLVVTGGYDGKWVWADREVLPRRWRAGEVYDTGVQITDIAQEGCFPCDTHEWK
>OsEXLA2
CDRCVRRSKAGFRDSSIALNAGSCGYGSLAASFNGGHLAAASPALFRGGVGCGACFQVRCKDGKLCSTAGAKVVVTDEARSTNRTDLVLSAAAYAAMARPGMAAQLRTRRAVDVEYKRVPCEYAAGRNLSIRVEEKSRPPRELSIRFLYQGGQTDIVAVDVATVGSSNWKFMTRDYGPAWSTAQAPAGPLQFRVVVTGGYDGKWVWADGEVLPRRWTAGRVYDAGVQIADVAQEGCYPCDTQEWK
>OsEXLA3
TPRASACERCVRNGKAAYSPSLSPLPPGGGGGCGYGAMAMEMELNGGFLAAGGPRQHRGGLGCGRCFQMRCRNAEVCSNAGVRVVLTDFHRSNSTDFLLGGPAFAGLAKPGMAHKLKKLDALSVEYRRIPCDYKDKNLSILVEEQSKRPNNLVIKFLYQGGQTDILAVDVAQVGSSDWRFMTRVYGPVWSIDRAPNGPLQFRAVVTGGYDGKWVWADREVLPANWQPGQVYDTGARIADVARESCLDCATLDWK
>cotCD809349
ETCSNCFTHSRAAYYPNSDEQGTDVGACGFGSFGATINGGDVSAVSDLYRNGVGCGACYQVRCTNSNYCSDKGVTVVITDQGSSHDTDFILSQRAFGRMAQTKDAAASLLALGVVDIEYRRVSCSYPNKNITIKIDENSNYPHYFAFVLWYQQGDKDITAVQLCETQNFVCKLLDRSHGAVWTTNSPPSGPLSLRMLLSGEDGR
>tomaTC118460
QNFIQSRAAYYPNSEEKGTETGACGFGTFGATINGGDVSAASDLYRDGLGCGACYQVRCTNSNYCSENGVTVVITDHGASDSTDFILSQRAFSRMAQTKDAASSLLSLGNVGIEYRRVSCSYPNKNITFKIDESSDNPYYLAFVIWYQQGKTDISAVQLCETENFVCKLLDRTRGAVWTSSSPPKGQLQIRMLLSSDDGDEKWIIPLNNIPQNWKGGETYDSGIQVD
|