Rice Expansins
Home About Expansins... Protein Structure Expansin Genes abstracts Nomenclature

 

EXPA1
EXPA2   LOC_Os01g60770.1
Genomic sequence length:
1823 nucleotides
CDS length: 756 nucleotides
Protein length: 251 amino acids       MW: 26673.6     pI: 7.2121

Genomic Sequence
>13101.t05383
ACAACCAAGCAACCATCGATTCGATCCTAGCTTAAGCCGAGTGAAGTAGCTAGCAGGACA
TTGTTGGTAGTGTGATCTTCTCCCTGTTTTGCAACAATGGCCTCACGCAGTAGTGCCCTG
CTCCTCCTCTTCTCGGCCTTCTGCTTCCTCGCCCGGCGAGCCGCCGCCGACTACGGCTCC
TGGCAGAGCGCCCACGCCACGTTCTACGGCGGCGGCGATGCGTCCGGCACGATGGGTACG
TGTGCACTGCTCCTCGTAACACTTTGTTTTTTTTTCTTTTGGACGCATTGACAAAACCGG
GAGTAATCTGACTCGTGTCGTGTTGTTTGTGTACATGAAAAGGCGGGGCGTGTGGGTATG
GGAACCTGTACAGCACCGGGTACGGCACCAACACGGCGGCGCTGAGCACGGTGCTGTTCA
ACGACGGCGCGGCGTGCGGGTCGTGCTACGAGCTGCGGTGCGACAACGATGGGCAGTGGT
GCCTGCCGGGCAGCGTCACCGTCACCGCCACCAACCTGTGCCCGCCGAACTACGCGCTCC
CCAACGACGACGGCGGCTGGTGCAACCCGCCGCGCCCCCACTTCGACATGGCCGAGCCGG
CCTTCCTCCAGATCGGCGTGTACCGCGCCGGCATCGTGCCCGTCTCCTACAGAAGGTGAG
CTCTACTTGGGTCAATGGGTCACGAGTGAGTGGCAAACTGTTTGCAACTTGAAGCCATGG
GCCCCCCGTGCGGTGTGTCGGCTTCGATGGGCCGATCCCCAATCGGCTGATCAGGCCATG
TGTAGGTGGGCCACCCGGCAGATTTGGTTTGTCCACGATCTGTCCGTGTCGGCTGTAGGC
AAAGCCATGCACGCACGACTTGACACCGACGTGGACTTGACTTCGCTGTCCCGGAATTAC
TTTACGCATCCCATTTCACATCCATATGATTAATCGTTGCAACAAATAGTAACAAATTAC
AGTTAATTCTACTGTTTCACTGAACTTTGCTTTTTTTTTTTGCTGGTGTGTTGCAGGGTG
CCGTGCGTGAAGAAGGGCGGGATCAGGTTCACCATCAACGGGCACTCCTACTTCAACCTG
GTTCTTGTGACCAACGTGGCCGGCCCAGGCGACGTGCAGTCCGTGTCGATCAAGGGGTCC
AGCACCGGGTGGCAGCCCATGTCCCGCAACTGGGGCCAGAACTGGCAGAGCAACTCCTAC
CTCGACGGCCAGAGCCTCTCCTTCCAGGTCGCCGTCAGCGACGGCCGCACCGTCACCAGC
AACAACGTCGTGCCGGCCGGCTGGCAGTTCGGCCAGACCTTCGAGGGCGGCCAGTTCTGA
TCGAGTACTCGGCTAATTCCGTTTTTCAGTTCACTTTTCTGCAGAAATGTCAGTGTGAGA
TTGAGATTGAGGTGGAGGAGGAGGGAGGCCTCTTTGCTATAGTAAAAGAGAAGAGGCTTT
GGCTATTCTGAGGCTGCTTATTAATTAGCACCCGCTTAGGCCTTTGCTTTCTCCCATAGG
TCCAGTAGACATAAATAAGTTATCATGGAGTTGGGCATTTTTTTTTTCTGCATTGTTGTA
GCCTTGGTTTAAAATTTGGCAAGGCAATTTAAGGATTATATTGTAGAACAGCAGGGAGGT
GGTGTCTTGTAACTTTTGTTGTTACATTGTAACCCCCCGAATCAGGCCACATGGGCCGGA
GGAGTTTGTGGGGGAGCACCGGAGCACTCTTGTGCTTTTGGGACCATGAACTATGCTAGT
TTATGGGTCTATGTATATGATATGTTTGTGATTTGATTATACTTTCTTTTGTCGCCTTAT
CAAGTACCAATCATTATTAAGTA

CDS
>13101.m06469
ATGGCCTCACGCAGTAGTGCCCTGCTCCTCCTCTTCTCGGCCTTCTGCTTCCTCGCCCGG
CGAGCCGCCGCCGACTACGGCTCCTGGCAGAGCGCCCACGCCACGTTCTACGGCGGCGGC
GATGCGTCCGGCACGATGGGCGGGGCGTGTGGGTATGGGAACCTGTACAGCACCGGGTAC
GGCACCAACACGGCGGCGCTGAGCACGGTGCTGTTCAACGACGGCGCGGCGTGCGGGTCG
TGCTACGAGCTGCGGTGCGACAACGATGGGCAGTGGTGCCTGCCGGGCAGCGTCACCGTC
ACCGCCACCAACCTGTGCCCGCCGAACTACGCGCTCCCCAACGACGACGGCGGCTGGTGC
AACCCGCCGCGCCCCCACTTCGACATGGCCGAGCCGGCCTTCCTCCAGATCGGCGTGTAC
CGCGCCGGCATCGTGCCCGTCTCCTACAGAAGGGTGCCGTGCGTGAAGAAGGGCGGGATC
AGGTTCACCATCAACGGGCACTCCTACTTCAACCTGGTTCTTGTGACCAACGTGGCCGGC
CCAGGCGACGTGCAGTCCGTGTCGATCAAGGGGTCCAGCACCGGGTGGCAGCCCATGTCC
CGCAACTGGGGCCAGAACTGGCAGAGCAACTCCTACCTCGACGGCCAGAGCCTCTCCTTC
CAGGTCGCCGTCAGCGACGGCCGCACCGTCACCAGCAACAACGTCGTGCCGGCCGGCTGG
CAGTTCGGCCAGACCTTCGAGGGCGGCCAGTTCTGA

Protein
>13101.m06469
MASRSSALLLLFSAFCFLARRAAADYGSWQSAHATFYGGGDASGTMGGACGYGNLYSTGY
GTNTAALSTVLFNDGAACGSCYELRCDNDGQWCLPGSVTVTATNLCPPNYALPNDDGGWC
NPPRPHFDMAEPAFLQIGVYRAGIVPVSYRRVPCVKKGGIRFTINGHSYFNLVLVTNVAG
PGDVQSVSIKGSSTGWQPMSRNWGQNWQSNSYLDGQSLSFQVAVSDGRTVTSNNVVPAGW
QFGQTFEGGQF*
 

EXPA3
EXPA4
EXPA5
EXPA6
EXPA7
EXPA8   LOC_Os01g14650.1
Genomic sequence length:
1162 nucleotides
CDS length: 756 nucleotides
Protein length: 251 amino acids  MW: 26333.7    pI: 8.7813

Genomic Sequence
>13101.t01314
ACAACACAAGAAGCCAGTTACAATAATTTAGAAGCAAAAGAAAAACTGTACCGAGCTTGC
ATTGCAGTGACGCGACAATGGCGGCCGCCAGAATGCTCGTGCTCCTGGCCTCTCTCTGCG
CTCTCCTGCTCACGGCGTCCGCGGCCAAATGGACTCCTGCCTTCGCGACGTTCTACGGCG
GCAGCGACGCCTCCGGCACCATGGGTTAGTAGCAAATTAAGCTCAGCTAATCACCATGCA
TGTGTTAATTAATCTTTTGCAAAGTTTAATTGGCATTTACAATAATGGAGTTTGGTCTCT
CTCTCGTCTCGTGTGTGATATCCAGGCGGCGCATGCGGGTACGGCGACCTGTACGGCGCC
GGGTACGGGACGCGGACGGCGGCGCTGAGCACGGCGCTGTTCAACGGCGGCGCGTCGTGC
GGCGCCTGCTTCACCATCGCCTGCGACACCCGCAAGACGCAGTGGTGCAAGCCGGGGACG
TCCATCACCGTGACGGCGACCAACTTCTGCCCTCCAAACTACGCCCTGTCCGGCGACGCC
GGCGGGTGGTGCAACCCGCCGCGCCGCCACTTCGACATGTCGCAGCCGGCGTGGGAGACG
ATCGCGGTGTACCGCGCCGGGATCGTGCCCGTGAACTACCGCCGCGTGCCGTGCCAGCGG
AGCGGCGGCATCCGGTTCGCCGTGAACGGGCACAGCTACTTCGAGCTGGTGCTGGTGACG
AACGTCGGCGGCAGCGGCGCGGTGGCGCAGATGTGGATCAAGGGGTCCGGGACGGGGTGG
ATGGCGATGAGCCGCAACTGGGGCGCGAACTGGCAGAGCAACGCGCGCCTCGACGGGCAG
GCGCTGTCGTTCCGGGTGCAGGCCGACGACGGCCGCGTCGTCACGGCGGCCGACGTCGCG
CCGGCGGGGTGGTCGTTCGGCGCCACCTACACCTCCTCGGCTCAGTTCTACTGATGAGCA
TTAATTGCAAGCCTATCTCATTTAATTAATCTGGACCGTTCGATGTAGTATATTGTATGC
TTTGATCGACGTGGCATAGGATGGAAGAGGCCGCCCACGAAGAAAATTTGGGACTTGTGT
ATATATCATTGTACCGCACTTTGTTTTTTTCTTTTTTGATTTGCTACTGTACTATACGCA
CGTAAGTTGTGCTTTTTCTTGT

CDS
>13101.m01611
ATGGCGGCCGCCAGAATGCTCGTGCTCCTGGCCTCTCTCTGCGCTCTCCTGCTCACGGCG
TCCGCGGCCAAATGGACTCCTGCCTTCGCGACGTTCTACGGCGGCAGCGACGCCTCCGGC
ACCATGGGCGGCGCATGCGGGTACGGCGACCTGTACGGCGCCGGGTACGGGACGCGGACG
GCGGCGCTGAGCACGGCGCTGTTCAACGGCGGCGCGTCGTGCGGCGCCTGCTTCACCATC
GCCTGCGACACCCGCAAGACGCAGTGGTGCAAGCCGGGGACGTCCATCACCGTGACGGCG
ACCAACTTCTGCCCTCCAAACTACGCCCTGTCCGGCGACGCCGGCGGGTGGTGCAACCCG
CCGCGCCGCCACTTCGACATGTCGCAGCCGGCGTGGGAGACGATCGCGGTGTACCGCGCC
GGGATCGTGCCCGTGAACTACCGCCGCGTGCCGTGCCAGCGGAGCGGCGGCATCCGGTTC
GCCGTGAACGGGCACAGCTACTTCGAGCTGGTGCTGGTGACGAACGTCGGCGGCAGCGGC
GCGGTGGCGCAGATGTGGATCAAGGGGTCCGGGACGGGGTGGATGGCGATGAGCCGCAAC
TGGGGCGCGAACTGGCAGAGCAACGCGCGCCTCGACGGGCAGGCGCTGTCGTTCCGGGTG
CAGGCCGACGACGGCCGCGTCGTCACGGCGGCCGACGTCGCGCCGGCGGGGTGGTCGTTC
GGCGCCACCTACACCTCCTCGGCTCAGTTCTACTGA

Protein
>13101.m01611
MAAARMLVLLASLCALLLTASAAKWTPAFATFYGGSDASGTMGGACGYGDLYGAGYGTRT
AALSTALFNGGASCGACFTIACDTRKTQWCKPGTSITVTATNFCPPNYALSGDAGGWCNP
PRRHFDMSQPAWETIAVYRAGIVPVNYRRVPCQRSGGIRFAVNGHSYFELVLVTNVGGSG
AVAQMWIKGSGTGWMAMSRNWGANWQSNARLDGQALSFRVQADDGRVVTAADVAPAGWSF
GATYTSSAQFY*

 

EXPA9    LOC_Os01g14660.1

Genomic sequence length: 903 nucleotides
CDS length: 765 nucleotides
Protein length: 254 amino acids    MW:  26986.9      pI: 6.4625

Genomic Sequence
>13101.t01315
GTTTCAGCAGCGAGATGGAGAAGAAGCTGTTGGTCGTCTTGTTCCTAAGCCTGTGCTGCG
CGTCTCGGCTCCGCGGCGAGGCGGCGCAGCAGTGGACGTCGGCCACCGCCACGTTCTACG
GCGGCAGCGACGCGTCCGGCACCATGGGTAACCTGGACGAACACCTTAACTGCGTGACAG
CTTTGATCACCATCCGGCCAATGCGGCAGTGCAGCACGCATCGCTTAGCTGCTTTGATCA
TGCAGGTGGATCGTGCGGGTACGGCAACATGTACAGCGCCGGGTACGGGACGAACACGAC
GGCGCTGAGCTCGGCGCTGTACGGCGACGGCGCGTCGTGCGGCGCGTGCTACCTCGTCAC
CTGCGACGCCTCGGCGACGCGGTGGTGCAAGAACGGCACGTCGGTGACCGTGACGGCGAC
CAACTACTGCCCGCCCAACTACAGCGAGTCCGGCGACGCCGGCGGGTGGTGCAACCCGCC
GCGGCGCCACTTCGACATGTCGCAGCCGGCGTGGGAGGCGATCGCCGTGTACAGCTCCGG
CATCGTCCCCGTCAGGTACGCGCGGACGCCGTGCAGGCGCGTCGGCGGCATCCGGTTCGG
CATCGCCGGGCACGACTACTACGAGCTGGTGCTCGTCACCAACGTCGCCGGCAGCGGCGC
CGTGGCGGCGGCGTGGGTGAAGGGCTCCGGGACGGAGTGGCTGTCGATGAGCCGGAACTG
GGGGGAGAACTGGCAGAGCAACGCGTACCTCACCGGCCAGGCGCTGTCGTTCAGGGTGCA
GGCCGACGACGGCGGCGTCGTCACGGCGTACGACGTCGCTCCGGCGAACTGGCAGTTCGG
GTCCACCTACCAGTCCGACGTCAACTTCTCCTACTAGGCCTGTCACCTGTGCAGGAATTT
CTT

CDS
>13101.m01612
ATGGAGAAGAAGCTGTTGGTCGTCTTGTTCCTAAGCCTGTGCTGCGCGTCTCGGCTCCGC
GGCGAGGCGGCGCAGCAGTGGACGTCGGCCACCGCCACGTTCTACGGCGGCAGCGACGCG
TCCGGCACCATGGGTGGATCGTGCGGGTACGGCAACATGTACAGCGCCGGGTACGGGACG
AACACGACGGCGCTGAGCTCGGCGCTGTACGGCGACGGCGCGTCGTGCGGCGCGTGCTAC
CTCGTCACCTGCGACGCCTCGGCGACGCGGTGGTGCAAGAACGGCACGTCGGTGACCGTG
ACGGCGACCAACTACTGCCCGCCCAACTACAGCGAGTCCGGCGACGCCGGCGGGTGGTGC
AACCCGCCGCGGCGCCACTTCGACATGTCGCAGCCGGCGTGGGAGGCGATCGCCGTGTAC
AGCTCCGGCATCGTCCCCGTCAGGTACGCGCGGACGCCGTGCAGGCGCGTCGGCGGCATC
CGGTTCGGCATCGCCGGGCACGACTACTACGAGCTGGTGCTCGTCACCAACGTCGCCGGC
AGCGGCGCCGTGGCGGCGGCGTGGGTGAAGGGCTCCGGGACGGAGTGGCTGTCGATGAGC
CGGAACTGGGGGGAGAACTGGCAGAGCAACGCGTACCTCACCGGCCAGGCGCTGTCGTTC
AGGGTGCAGGCCGACGACGGCGGCGTCGTCACGGCGTACGACGTCGCTCCGGCGAACTGG
CAGTTCGGGTCCACCTACCAGTCCGACGTCAACTTCTCCTACTAG

Protein
>13101.m01612
MEKKLLVVLFLSLCCASRLRGEAAQQWTSATATFYGGSDASGTMGGSCGYGNMYSAGYGT
NTTALSSALYGDGASCGACYLVTCDASATRWCKNGTSVTVTATNYCPPNYSESGDAGGWC
NPPRRHFDMSQPAWEAIAVYSSGIVPVRYARTPCRRVGGIRFGIAGHDYYELVLVTNVAG
SGAVAAAWVKGSGTEWLSMSRNWGENWQSNAYLTGQALSFRVQADDGGVVTAYDVAPANW
QFGSTYQSDVNFSY*
 

 

EXPA10
EXPA11   LOC_Os01g16770.1

Genomic sequence length: 798 nucleotides
CDS length: 705 nucleotides
Protein length: 234 amino acids    MW:  23642    pI: 7.6803
note: predicted start site might be several aa ahead of what is shown below (making a longer signal peptide)

Genomic Sequence
>13101.t01522
CTACGCTTGCTAGCTGTTGCCGCCGTGGCCGCGATGGCCGCGGAGGTCGCCGCCGGCGGC
GACTCCGGCTGGAGCAGCGGCAGCGCGACGTTCTACGGCGGCAGCGACGCGTCGGGGACG
ATGGGTGGCGCGTGCGGGTACGGCAACCTGTACAGCGCCGGGTACGGGACGAGCACGGCG
GCGCTGAGCACGGCGCTGTTCAACAACGGGCAGAGCTGCGGCGCGTGCTTCGAGGTCCGG
TGCGGCGGCGGCGGCAGCTGCCTCGCCGGCACCGTGGCGGTGACGGCCACCAACCTCTGC
CCGCCCAACTACGCGCTCGCCGGCGACGCCGGCGGGTGGTGCAACCCGCCGCGGCCGCAC
TTCGACATGGCGGAGCCGGCGTTCACCAGGATCGCGCAGGCCCGCGCCGGCGTGGTGCCG
GTGCAGTACCGGCGCGTGGCGTGCGCGAAGCAGGGCGGCATCCGGTTCACCATCACCGGC
CACTCCTACTTCAACCTGGTGCTCGTCACCAACGTCGGCGGCGCCGGCGACGTGACGGCG
GTGTCGGTGAAGGGGTCGCGGTCGGGGTGGCAGGCCATGAGCCACAACTGGGGCGCCAAC
TGGCAGAACGGCGCCAACCTCGACGGCCAGCCGCTCTCCTTCAGGGTCACCGCCAGCGAC
GGCCGCACCGTCACCTCCGACAACGTCGCCCCCTCCGGCTGGTCCTTCGGCCAGACCTTC
TCCGGCGGCCAGTTCTAGCCGTTCCGCCGTCGATCGCCGGAGGAATTGAAGGGGCTCTGT
TTTGAAGAAGCTCCAAAC

CDS
>13101.m01874
ATGGCCGCGGAGGTCGCCGCCGGCGGCGACTCCGGCTGGAGCAGCGGCAGCGCGACGTTC
TACGGCGGCAGCGACGCGTCGGGGACGATGGGTGGCGCGTGCGGGTACGGCAACCTGTAC
AGCGCCGGGTACGGGACGAGCACGGCGGCGCTGAGCACGGCGCTGTTCAACAACGGGCAG
AGCTGCGGCGCGTGCTTCGAGGTCCGGTGCGGCGGCGGCGGCAGCTGCCTCGCCGGCACC
GTGGCGGTGACGGCCACCAACCTCTGCCCGCCCAACTACGCGCTCGCCGGCGACGCCGGC
GGGTGGTGCAACCCGCCGCGGCCGCACTTCGACATGGCGGAGCCGGCGTTCACCAGGATC
GCGCAGGCCCGCGCCGGCGTGGTGCCGGTGCAGTACCGGCGCGTGGCGTGCGCGAAGCAG
GGCGGCATCCGGTTCACCATCACCGGCCACTCCTACTTCAACCTGGTGCTCGTCACCAAC
GTCGGCGGCGCCGGCGACGTGACGGCGGTGTCGGTGAAGGGGTCGCGGTCGGGGTGGCAG
GCCATGAGCCACAACTGGGGCGCCAACTGGCAGAACGGCGCCAACCTCGACGGCCAGCCG
CTCTCCTTCAGGGTCACCGCCAGCGACGGCCGCACCGTCACCTCCGACAACGTCGCCCCC
TCCGGCTGGTCCTTCGGCCAGACCTTCTCCGGCGGCCAGTTCTAG

Protein
>13101.m01874
MAAEVAAGGDSGWSSGSATFYGGSDASGTMGGACGYGNLYSAGYGTSTAALSTALFNNGQ
SCGACFEVRCGGGGSCLAGTVAVTATNLCPPNYALAGDAGGWCNPPRPHFDMAEPAFTRI
AQARAGVVPVQYRRVACAKQGGIRFTITGHSYFNLVLVTNVGGAGDVTAVSVKGSRSGWQ
AMSHNWGANWQNGANLDGQPLSFRVTASDGRTVTSDNVAPSGWSFGQTFSGGQF*
 
EXPA12
EXPA13    LOC_Os02g16730.1

Genomic sequence length: 1630 nucleotides
CDS length: 789 nucleotides
Protein length: 262 amino acids       MW:  27971.1     pI: 9.1068

Genomic Sequence
>13102.t01479
GTTTCACCATTCCTCGAACACCATCATCCACAAACACACAAGTACATCTCGATCGAAGCT
AGCTAGACTAGAGCACGCACACGTACGATCTCGTCCATGGCGGGCGTCGCTCGCATGCTC
GCGGCCGTGGTCTGCGCGATCATGCCGGCGGCGGCGATGGCGGCCGGCGGCGTGGGCGCG
CTGGAGCCGAGCGGGTGGGTGAGGGCGCACGCGACGTTCTACGGCGGCGCGGACGCGTCG
GGAACCATGGGCGGGGCGTGCGGGTACGGCAACCTGTACGCGCAGGGGTACGGCACGAGG
ACGGCGGCGCTCAGCACGGCGCTGTTCAACGACGGCCTCGCCTGCGGGCAGTGCTACAAG
CTCGTCTGCGACCGCAAGACGGACCGGACGTGGTGCAAGCCGGGCGTCTCCGTCACCATC
ACCGCCACCAACTTCTGCCCGCCCAACTGGGACCTCCCCAGCGACAGCGGCGGCTGGTGC
AACCCGCCGCGCCCCCACTTCGACATGGCCCAGCCCGCCTGGGAGAAGATCGGCATCTAC
CGCGGCGGCATCATCCCCGTCATCTACCAAAGGTAAACAACACAATCACATATGTCTAGT
GGTCGTTGCAGTGACCTGAATAGCACTCTAAGGTTCTAAGTTCAGATCTTCATAGGAGTG
AATTTCAGATTATGTTGTTTGAGGGACTGAGTTTCCGATTCAAGACGATTTGAGAGGCTG
GATAAAAAATACCATTCTCTAAAAAAAAATTATCACCTTCGACATCGCAACATCGATCAA
GCATTGATGATTGATTCTACTGGGTGGCGGAGTCAAAAAAAAAAACATTCACATATAAGG
CTGAGCTAATGAAAGTTCAAATATTTTCACAACCACATAGTATTTTTTAAGATTATAATG
AGAGCTATGGATCTAGGCATTAGGCTCGAGTCCTGGCTGCCCCATGCCTGATGCGGCCTC
AAATTTGTACTACCACCGTAAAAATCACTTTATTTTTGATGTTTTTGTGTAGGGTTCCTT
GCATGAAGAAGGGCGGGGTGCGGTTCACCATCAACGGGCACGACTACTTCCAGCTGGTGC
TGCTGACCAACGTCGGGGCGGCCGGCTCGATCAAGGCCATGGACGTGAAGGGCTCCAAGT
CGCCGGACTGGATGGCCATGGCGCACAACTGGGGCGCCCAGTGGCACTCGCTGGCCTACC
TCACCGGCCAGGGCCTCTCCTTCCGGGTCACCATCACCGATGGCCAGACGCTCGTCTTCC
CCAACGTCGTTCGCCCCGGATGGAGGTTCGGCCAAACGTTCGCGAGCAACATACAGTTCA
AGTGAGCCACCTGATCGATCGAAGCTTCTATATAGGCGCAGCTAGCATCGTCGTGCATAT
GCGTGAAGGATTTAATTTGGCGTCCAAACTTTATATATACTTCTTTTCCAAAATGATCGA
TCACTCTCTCAATTTGGATGCTGATGGGGGTTGATTTACTATGGTATTGTGTAATTAATT
ACCAGCCTGAAACCAACATTTTCATTGGGCTTTTGTATGTATGTGCGATGTACATAAAGA
GTTCGATCTAAATGATTCTTATGGAAGTATGATTATCACGCGATTTGAGTAAACTCCAAA
TTTGGTGTTT

CDS
>13102.m01872
ATGGCGGGCGTCGCTCGCATGCTCGCGGCCGTGGTCTGCGCGATCATGCCGGCGGCGGCG
ATGGCGGCCGGCGGCGTGGGCGCGCTGGAGCCGAGCGGGTGGGTGAGGGCGCACGCGACG
TTCTACGGCGGCGCGGACGCGTCGGGAACCATGGGCGGGGCGTGCGGGTACGGCAACCTG
TACGCGCAGGGGTACGGCACGAGGACGGCGGCGCTCAGCACGGCGCTGTTCAACGACGGC
CTCGCCTGCGGGCAGTGCTACAAGCTCGTCTGCGACCGCAAGACGGACCGGACGTGGTGC
AAGCCGGGCGTCTCCGTCACCATCACCGCCACCAACTTCTGCCCGCCCAACTGGGACCTC
CCCAGCGACAGCGGCGGCTGGTGCAACCCGCCGCGCCCCCACTTCGACATGGCCCAGCCC
GCCTGGGAGAAGATCGGCATCTACCGCGGCGGCATCATCCCCGTCATCTACCAAAGGGTT
CCTTGCATGAAGAAGGGCGGGGTGCGGTTCACCATCAACGGGCACGACTACTTCCAGCTG
GTGCTGCTGACCAACGTCGGGGCGGCCGGCTCGATCAAGGCCATGGACGTGAAGGGCTCC
AAGTCGCCGGACTGGATGGCCATGGCGCACAACTGGGGCGCCCAGTGGCACTCGCTGGCC
TACCTCACCGGCCAGGGCCTCTCCTTCCGGGTCACCATCACCGATGGCCAGACGCTCGTC
TTCCCCAACGTCGTTCGCCCCGGATGGAGGTTCGGCCAAACGTTCGCGAGCAACATACAG
TTCAAGTGA

Protein
>13102.m01872
MAGVARMLAAVVCAIMPAAAMAAGGVGALEPSGWVRAHATFYGGADASGTMGGACGYGNL
YAQGYGTRTAALSTALFNDGLACGQCYKLVCDRKTDRTWCKPGVSVTITATNFCPPNWDL
PSDSGGWCNPPRPHFDMAQPAWEKIGIYRGGIIPVIYQRVPCMKKGGVRFTINGHDYFQL
VLLTNVGAAGSIKAMDVKGSKSPDWMAMAHNWGAQWHSLAYLTGQGLSFRVTITDGQTLV
FPNVVRPGWRFGQTFASNIQFK*
 
EXPA14   LOC_Os02g16780.1

Genomic sequence length: 898 nucleotides
CDS length: 789 nucleotides
Protein length: 262 amino acids      MW: 28387.3      pI: 8.7785
 

Genomic Sequence
>13102.t01484
ATGGCTTCATCGCCTCGAGCTTTCGCGTTGGTGTTTTTCGCAATCGCGGCCGTCGGCTGC
ACTCAGCTGACCACGGCCGACGACGCGGCTCCGCCCGTGTGGCAGAAGGCGCACGCGACG
TTCTACGGCGGCGCCGACGCGTCGGGCACCATGGGAGGCGGGTGCGGGTATGGCGACCTG
TACTCGCAGGGGTACGGCACGCGGAACGCGGCGCTGAGCACGGCGCTGTTCAACGACGGC
GCGTCGTGCGGGCAGTGCTACAAGATCGCCTGCGACCGGAAGAGGGCGCCGCAGTGGTGC
AAGCCCGGCGTCACGGTCACCATCACCGCCACCAACTTCTGCCCGCCCAACTGGGACCTG
CCCAGCGACAATGGTGGCTGGTGCAACCCGCCGCGGCCGCACTTCGACATGGCGCAGCCG
GCGTGGGAGAAGATTGGCATCTACAGCGCGGGCATCATCCCGGTTATCTACCAAAGGTAC
ATAAAATATAATTTGACAGCAAAACAAAAATATCATTTTCGGACATATTGTACCTATGAC
ATGACGATAGAACAGATTTGTAACAATTTTTTGTAATGTTTTCAGGGTTCCATGCATAAA
GAAGGGTGGGGTGCGGTTTACCATTAACGGGCACGACTACTTCAATCTTGTGCTCGTGAC
AAACGTTGCAACCACTGGTTCGATCAAGTCAATGGACATCATGGGATCCAACTCAACCGA
CTGGATGCCAATGGTGAGAAACTGGGGTGCAAACTGGCACTCACTATCGTATCTCACAGG
GCAGACGCTCTCCTTCAGGGTGACTAACATGGATGGCCAGACGCTTGTCTTCAAGAACAT
TGTGCCATCTGGATGGAAATTTGGGCAAACATTTACAAGCAAGCTGCAATTCAAGTAA

CDS
>13102.m01877
ATGGCTTCATCGCCTCGAGCTTTCGCGTTGGTGTTTTTCGCAATCGCGGCCGTCGGCTGC
ACTCAGCTGACCACGGCCGACGACGCGGCTCCGCCCGTGTGGCAGAAGGCGCACGCGACG
TTCTACGGCGGCGCCGACGCGTCGGGCACCATGGGAGGCGGGTGCGGGTATGGCGACCTG
TACTCGCAGGGGTACGGCACGCGGAACGCGGCGCTGAGCACGGCGCTGTTCAACGACGGC
GCGTCGTGCGGGCAGTGCTACAAGATCGCCTGCGACCGGAAGAGGGCGCCGCAGTGGTGC
AAGCCCGGCGTCACGGTCACCATCACCGCCACCAACTTCTGCCCGCCCAACTGGGACCTG
CCCAGCGACAATGGTGGCTGGTGCAACCCGCCGCGGCCGCACTTCGACATGGCGCAGCCG
GCGTGGGAGAAGATTGGCATCTACAGCGCGGGCATCATCCCGGTTATCTACCAAAGGGTT
CCATGCATAAAGAAGGGTGGGGTGCGGTTTACCATTAACGGGCACGACTACTTCAATCTT
GTGCTCGTGACAAACGTTGCAACCACTGGTTCGATCAAGTCAATGGACATCATGGGATCC
AACTCAACCGACTGGATGCCAATGGTGAGAAACTGGGGTGCAAACTGGCACTCACTATCG
TATCTCACAGGGCAGACGCTCTCCTTCAGGGTGACTAACATGGATGGCCAGACGCTTGTC
TTCAAGAACATTGTGCCATCTGGATGGAAATTTGGGCAAACATTTACAAGCAAGCTGCAA
TTCAAGTAA

Protein
>13102.m01877
MASSPRAFALVFFAIAAVGCTQLTTADDAAPPVWQKAHATFYGGADASGTMGGGCGYGDL
YSQGYGTRNAALSTALFNDGASCGQCYKIACDRKRAPQWCKPGVTVTITATNFCPPNWDL
PSDNGGWCNPPRPHFDMAQPAWEKIGIYSAGIIPVIYQRVPCIKKGGVRFTINGHDYFNL
VLVTNVATTGSIKSMDIMGSNSTDWMPMVRNWGANWHSLSYLTGQTLSFRVTNMDGQTLV
FKNIVPSGWKFGQTFTSKLQFK*
 
EXPA15
EXPA16
EXPA17
EXPA18
EXPA19
EXPA20
EXPA21
EXPA22
EXPA23    LOC_Os02g16809.1      LOC_Os02g16839.1     Os02g0268050
   note the ambiguity in the MSU prediction

Genomic sequence length: 975 nucleotides
CDS length: 804 nucleotides
Protein length: 267 amino acids    MW:  28573.4   pI: 9.246

Genomic Sequence
>13102.t01487
ATGGCCCCAGCTCGAGCGTTCGTGTTGGTGCTGCTCGCAGTCGCCAGTGCATCGACGGCC
GCGGCCAACACAGCGACGACGACGCCCACAAACCCGGTGGCTGCGCCGACCCAGTGGCAG
AAGGCGCACGCGACGTTCTACGGCGGCGCGGACGCGTCGGGCACCATGGGCGGGGCGTGC
GGATACGGCAACCTGTACTCGCAGGGGTACGGCACGCGGAACGCGGCGCTGAGCACGGCG
CTGTTCAACGACGGCGCGTCGTGCGGGCAGTGCTACAAGATCGCCTGCGACCGCAAGAGG
GCGCCGCAGTGGTGCAAGCCCGGCGTCACCGTTACCATCACCGCCACCAATTTCTGCCCG
CCCAACTGGAACCTTCCCAGTGATAATGGTGGCTGGTGCAACCCACCACGGCCGCACTTT
GACATGGCACAGCCGGCCTGGGAGAAGATCGGCGTCTACAGCGCAGGCATCATACCGGTC
ATCTATCAAAGGTACCAATACAAATACTCCAATTATTTTGGACATTATGGAGAGAACAGT
GCATATCCTGACCGCTGCTTCCATATGAAAATATCCTACTCTTGCGATGATTTCTACAGG
GTTCCTTGCGTGAAGAAGGGTGGCCTGCGGTTCACCATTAACGGTCACGACTACTTCCAG
CTAGTACTGGTGACCAACGTCGCGGCGGCAGGGTCAATCAAGTCCATGGAGGTTATGGGT
TCCAACACAGCGGATTGGATGCCGATGGCACGTAACTGGGGCGCCCAATGGCACTCACTG
GCCTACCTCACCGGTCAAGGTCTATCCTTTAGGGTCACCAACACAGATGACCAAACGCTC
GTCTTCACCAACGTCGTGCCACCAGGATGGAAGTTTGGCCAGACATTTGCAAGCAAGCTG
CAGTTCAAGTGAGAGGAGAAGCCTGAATTGATACCGGAGCGTTTCTTTTGGGAGTAACAT
CTCTGGTTGCCTAGC

CDS
>13102.m01880
ATGGCCCCAGCTCGAGCGTTCGTGTTGGTGCTGCTCGCAGTCGCCAGTGCATCGACGGCC
GCGGCCAACACAGCGACGACGACGCCCACAAACCCGGTGGCTGCGCCGACCCAGTGGCAG
AAGGCGCACGCGACGTTCTACGGCGGCGCGGACGCGTCGGGCACCATGGGCGGGGCGTGC
GGATACGGCAACCTGTACTCGCAGGGGTACGGCACGCGGAACGCGGCGCTGAGCACGGCG
CTGTTCAACGACGGCGCGTCGTGCGGGCAGTGCTACAAGATCGCCTGCGACCGCAAGAGG
GCGCCGCAGTGGTGCAAGCCCGGCGTCACCGTTACCATCACCGCCACCAATTTCTGCCCG
CCCAACTGGAACCTTCCCAGTGATAATGGTGGCTGGTGCAACCCACCACGGCCGCACTTT
GACATGGCACAGCCGGCCTGGGAGAAGATCGGCGTCTACAGCGCAGGCATCATACCGGTC
ATCTATCAAAGGGTTCCTTGCGTGAAGAAGGGTGGCCTGCGGTTCACCATTAACGGTCAC
GACTACTTCCAGCTAGTACTGGTGACCAACGTCGCGGCGGCAGGGTCAATCAAGTCCATG
GAGGTTATGGGTTCCAACACAGCGGATTGGATGCCGATGGCACGTAACTGGGGCGCCCAA
TGGCACTCACTGGCCTACCTCACCGGTCAAGGTCTATCCTTTAGGGTCACCAACACAGAT
GACCAAACGCTCGTCTTCACCAACGTCGTGCCACCAGGATGGAAGTTTGGCCAGACATTT
GCAAGCAAGCTGCAGTTCAAGTGA

Protein
>13102.m01880
MAPARAFVLVLLAVASASTAAANTATTTPTNPVAAPTQWQKAHA TFYGGADASGTMGGACGYGNLYSQGYGTRNAALSTALFNDGASCGQCYKIACDRKRAP QWCKPGVTVTITATNFCPPNWNLPSDNGGWCNPPRPHFDMAQPAWEKIGVYSAGIIPV IYQRVPCVKKGGLRFTINGHDYFQLVLVTNVAAAGSIKSMEVMGSNTADWMPMARNWG AQWHSLAYLTGQGLSFRVTNTDDQTLVFTNVVPPGWKFGQTFASKLQFK*
 
EXPA24    LOC_Os02g16800

Genomic sequence length: 956 nucleotides
CDS length: 837 nucleotides
Protein length: 278 amino acids      MW:  29974.3      pI: 8.892

Genomic Sequence
>13102.t01486
ACGTCCATGGCGGATATGGCTCCAGCTCGAGCACTCGCCTTGGTGTTGCTCGCAGTTGCA
GTCGGCAGCGCGTTGATGGCCGCGGCCCAGGATGCGCCGTCGCCACCGACACCGATGGCT
CCGTCTCCGTCTACCGATGAAACTCCGCCCGTGTGGCTGAAGGCGCACGCGACGTTCTAC
GGCGGGGCGGACGCGTCGGGCACCATGGGCGGGGCGTGCGGCTACGTCGACCTGTACTCG
CAGGGGTACGGGACGCGGAACGCGGCGCTGAGCACGGCGCTGTTCAACGACGGCGCGTCG
TGCGGGCAGTGCTACAAGATCGCCTGCGACCGCAAGAGGGCGCCGCAGTGGTGCAAGCCC
GGCGTCACGGTCACCGTCACCGCCACCAACTTCTGCCCGCCCAACTGGAACCTCCCCAGC
GACAACGGCGGCTGGTGCAACCCGCCGCGGCCGCACTTCGACATGGCGCAGCCTGCTTGG
GAGAAGATTGGCATCTACCGTGCTGGCATCATCCCGGTCATGTACCAAAGGTATATAATT
TAAACTGAGAAAAAAAAACATATTGTTTTTGACATATAGCACCTATGGCATATTTTAAGT
TGTAACATATTTGGTGATATTTTCAGGGTTCCGTGTGTGAAGAAGGGTGGGGTGCGGTTT
ACCATCAATGGGCATGACTACTTTAATCTTGTGCTCGTGACAAATGTTGCAACCACTGGT
TCGATCAAGTCGATGGACATCATGGGCTCCAACTCAACCGACTGGATGCCAATGGTGAGG
AACTGGGGTGCAAACTGGCACTCGCTGTCGTATCTCACCGGGCAGATGCTCTCCTTCAGG
GTGACGAACATGGATGGCCAGACACTAGTCTTTAGGAACATTGTGCCCTCTGGATGGAAG
TTCGGGCAAACATTTGCAAGCAAACTGCAGTTCAAGTAATTTAATTCCCGGAAAGA

CDS
>13102.m01879
ATGGCGGATATGGCTCCAGCTCGAGCACTCGCCTTGGTGTTGCTCGCAGTTGCAGTCGGC
AGCGCGTTGATGGCCGCGGCCCAGGATGCGCCGTCGCCACCGACACCGATGGCTCCGTCT
CCGTCTACCGATGAAACTCCGCCCGTGTGGCTGAAGGCGCACGCGACGTTCTACGGCGGG
GCGGACGCGTCGGGCACCATGGGCGGGGCGTGCGGCTACGTCGACCTGTACTCGCAGGGG
TACGGGACGCGGAACGCGGCGCTGAGCACGGCGCTGTTCAACGACGGCGCGTCGTGCGGG
CAGTGCTACAAGATCGCCTGCGACCGCAAGAGGGCGCCGCAGTGGTGCAAGCCCGGCGTC
ACGGTCACCGTCACCGCCACCAACTTCTGCCCGCCCAACTGGAACCTCCCCAGCGACAAC
GGCGGCTGGTGCAACCCGCCGCGGCCGCACTTCGACATGGCGCAGCCTGCTTGGGAGAAG
ATTGGCATCTACCGTGCTGGCATCATCCCGGTCATGTACCAAAGGGTTCCGTGTGTGAAG
AAGGGTGGGGTGCGGTTTACCATCAATGGGCATGACTACTTTAATCTTGTGCTCGTGACA
AATGTTGCAACCACTGGTTCGATCAAGTCGATGGACATCATGGGCTCCAACTCAACCGAC
TGGATGCCAATGGTGAGGAACTGGGGTGCAAACTGGCACTCGCTGTCGTATCTCACCGGG
CAGATGCTCTCCTTCAGGGTGACGAACATGGATGGCCAGACACTAGTCTTTAGGAACATT
GTGCCCTCTGGATGGAAGTTCGGGCAAACATTTGCAAGCAAACTGCAGTTCAAGTAA

Protein
>13102.m01879
MADMAPARALALVLLAVAVGSALMAAAQDAPSPPTPMAPSPSTDETPPVWLKAHATFYGG
ADASGTMGGACGYVDLYSQGYGTRNAALSTALFNDGASCGQCYKIACDRKRAPQWCKPGV
TVTVTATNFCPPNWNLPSDNGGWCNPPRPHFDMAQPAWEKIGIYRAGIIPVMYQRVPCVK
KGGVRFTINGHDYFNLVLVTNVATTGSIKSMDIMGSNSTDWMPMVRNWGANWHSLSYLTG
QMLSFRVTNMDGQTLVFRNIVPSGWKFGQTFASKLQFK*
 
EXPA25
EXPA26
EXPA27
EXPA28
EXPA29
EXPA30
EXPA31
EXPA32
EXPA33
 
EXPB1a
EXPB1b
EXPB2
EXPB3
EXPB4
EXPB5
EXPB6
EXPB7
EXPB8
EXPB9
EXPB10
EXPB11
EXPB12
EXPB13
EXPB14
EXPB15
EXPB16    LOC_Os02g42650   Os02g0639500 
note that the predicted start site differs from our analysis, resulting in a longer predicted signal peptide

Genomic sequence length: 7174 nucleotides
CDS length: 819 nucleotides
Protein length: 272 amino acids

Genomic Sequence
>13102.t03879
CCCCTGTGTCCCGTTCGTGCCATTTGTGCAGCGAGCTAGCGTCCATCCTGAGCTCTCAGA
GCTTCGGGGACTGGGGCTTCTCGAAGGCTTAAAAGCGACCCACGCACAGGGGGAGTGAGA
GGAGTGAGAGTGAGCTGCTCCTGCACAAACAAGACGGCAGTAGCGACAGGATAGCTGCTT
CGATCAGCTAGCTCTTCGCTATGGCAGCCTTCTCCTCGAGCTCGTCTGCTCCCATGTTGA
TACGCTCCGTGCTCTTCGTGTCTCTCCTGTCCGCCGCGTTCGTCTTCGACTCCGGCGAGG
CTGGTGCGGCGCACAGGGTGGTCGACCCGGAGTGGCACCCGGCCACGGCCACCTGGTACG
GCAGCGCTGACGGCGACGGCAGCGACGGTGAGCTCTAGCTCCTACCACTTAATTAAACCC
TGCAGCTGCTACGACATCATTAACCTGATTCCTTTACCTTCCAAGAGAGTAGATAAGCTT
AATTAATTTACGTTCGCGAAAATTAAGCTTCAAGTCTTCAACAATGGCGGCCTTCGGTCC
CCTGTGCAGAATGCACGCATTGCGCGCTGCGACTGACGCATCAGGCCAGGTTTTCGTTCT
TTTCTCCTTTGATGATTATTCCAGATCACACTTGGTATTAACCGTCAGATTCGCATCCTA
TAGCTGCAGTGTTTCTAGCTTCCTAAGAAAATGGCGTACTTCTAGCCGTTGGCACATTTT
TGGTGCCTCTCCTCTCGGTAGTTGGCCCGGAATTTCGGCGGGCCTACGTGCATTCCCGCA
CTAGTGTCACGCTAGGACGGGCGCGTCCGGAGCCTTTAAGATTCGTGCATGTTTGTGATT
GTGAAAGATTATAAATTTATAATGGCCGGCACACGAGCGTGGAGCCTAGCTAGGGCGTGC
CCCTGTGTCGGGTCCTTTAGCTTTGCCGCATTTACGAGTAGCTAGGAACGTACCAACACG
GGAAACTGAGGAAAACCCAGCCGCTTTAATCCACCGACACTCCCGGTATATTTTGCGCCG
GACACAAGCAAACAAGTGAGTGGGCATACGGCCGGAGAAATTAACTGCTGACAAAATTTC
CACCGTTTGTTATCATGGGCGAACTGCATAAAGTGCACACACCTGAGACCTGACAATGCT
GAACTTACGTTAGGTTTGTTCTCTGTTACCACACTTAACGCAATAACGCATATTCACAAA
GTTCCTGATGTCTTCCTGTCCGTTATATATACAGCCATTCACATCTCGGCTAGCTACCCC
CCCCCCCCCCCCACCCCAACAAATGATCGCGTTGCCCCGTGGCCACTGGCCAGTATAAAT
TTTTACCAGATCTGAGTCACCCAGTGCCCACCATGAGGCCACGATGGATGTTTCCATGTG
TAAGCTTTCCCTTCCCGTTTTCTAGTATCTGGCACACTACGGCCCAAATTAACGAACTGT
TCATTGCGTGTCCATGCAAGCATCAAGCATGCATGCATGTCTTCTTCCTTGACTGTCCAT
GGGCCTGGTCGGCAGTAAACTTCTATACAACCATCGTCCAAAAATGTAGTAACTTAGATT
AGACCAATTTTAAATTACGAATCTCATTTTTAGATGACGGGTAATCACGTGAGCATTTCT
AATTTCAGAGTTGTAACCTGTTTGGCTGTATGTATGTACGTTTTTTTTTTTTGCAGGCGG
CGCGTGTGGATACGGGACGCTGGTGGACGTGGTGCCGATGAAGACGCGGGTGGGCGCGGT
GAGCCCCGTGCTGTTCAAGGGCGGCGAGGGGTGCGGCGCCTGCTACAAGGTGCGTTGCCT
CGACGCCAGCATCTGCTCGCGCCGCGCCGTCACGGTCATCGTCACCGACGAGTGCCCCGG
CGGCGTCTGCGCCTTCGGCCGCACGCACTTCGACCTCAGCGGCGCCGCCTTCGCCAGGCT
CGCCGTCGCCGGCCACGGCGGCCAGCTGCAGAACCGAGGCGAGATCTCGGTGGTGTACCG
CAGGTGAGCACCTAACAGTACATTTACTCAGCTTATTATACTACAGTATTGCATAGCCTG
GGGGGCAGAGGCAGATCGTGCACGGCGTACGTCGTGGTGGTCCGTATGTTGTTGTTGGCA
TGCCTGACGTTTACTGAGAGAACGCAGCCACTTGGAACTGAAAAGAAAGCCAGTTGGAAC
TCTGCCAAAAAGTACGTATCCTGCAGGCTGCAGCTGGTACTACCGCTGTATTTTTTCCCT
GCCCGGCAATTGTACAGCATGCTTAATTAATTAGAGTAGGCACGAAGGTTAAAAATGTCG
AGGCATAAATTAATGAGGAGGAAGACACTGTCAGATCTGAAGTACAGAGAGGACCATGGT
CCATGGTGAACATATGCCTCTGCGTCTGCGCTCGCAGTAGTTGGACTGTTGGAGTGTTGG
ATCTTGGAGGCGCCATGCGATGCAAGACTCCTCTCGTCACGACCTCCCGGTGTGGACGTG
TGGTCATCCATGCCTTCTCTCCATTCACAGATTCACTGCATGCTAGTCTGGATACCGTGA
AGAAATTAAAGAGGCCGGCCGGGCAGGGCACACCATGGAAAACGCAATGCTAGATGGAGT
AGCTGGTAGCTGCAGAGTTCACGCTTGTCATATTCATATCGTGCGAGGAAACAGTTTTTA
CGGTTTTACTACTAACGATAGCTGCATGCTCACGAGTTTGTCGTCGTTTTTCAGGACGGC
GTGCAAGTACGGGGGGAAGAACATTGCCTTCCACGTGAACGAGGGCTCGACGACCTTCTG
GCTCTCGCTTCTCGTCGAATTCGAGGATGGAGACGGCGACATTGGATCCATGCAGCTAAA
ACAGGTAAAAGAAATGGTCCAACTCGATTGCCGTCAATTTCAGGTCCTGGCTAGTAATAG
TACTGCTTCTGCCTTTGCATTCCCAATTGTCGCTCACTTGCATAGTTGCACGCACTCTAC
TCAACTTGCATAGTACACTTACCTGATAGCAGGGGGGAAATGGTGTTTTAGATGAGTAAA
TATATTGTACAGTGCTCTGCTTTCATGGCATCCATGCATGTGAGTCAGGGATGGATCACT
CTTCTCAGTTCTCACATGTGGTTGGGGGACACGGCTGATGATTTCCTGGACGAACAGGCA
GCAAAGGAGTAGGCCCTTTGCTTTGTTGCCCCCTTTTGTCCAGGGCAACAGATCTATAGT
TCTAGAATCGCATGTTGAAAGAACTGCGCCAAGCTAAAAAGAAACTACTATACCGATTAT
CTAATGATAACTTGTAGTACTAGTACTTGAATGTAAATAATTCAGTACTTGCTATATTTT
GAGATGGTGGGAGTAGCTCCTTGCAGTTTTTTCTTGCACGTTAAACATTTCTGCAGCCAT
ATGTTACCTACTACACCATTTGTTATACTGTACTAGTATCACCCTTCCAAACCCCTACTG
GTTTCAGCCAACTTTTTCCACCATGCATATTGAAACATATCCATTAATCCATCTATCCAC
CATGGGTGCCATTCTAGTTGACAGCCCAAAACAAGTGCCCTGCTTGGCCCCCTTTTAGGC
TTCTTGGCTGTGTTCGGCATCACCTTTTCCCAATCCTTCTCCCACATTTTCTGCGCGCAT
GTTTTTTAAACTGCTAAACGGTGTATTTTTTATAAAAAAAATTCTATATAAAAGTTGTTT
AAAAAATCAAATTAATCCATTTTTTAAAAAAACTAGGAAGGTGGCCCGCGCGCATGCGCG
GGCACTTATAATATTAAAAGGTAAGATTTTTATTGTTTGTTTAAAAATTTTGTCTCATAA
ATTAAGGGAAAACTAAAGTTTGGATACTTTTATTTTGCAATATTTTTTGTAATAATTTAA
GGGTTATATTTTTTAAGGTATCTTGAAAATCCATTCACAAACTTTTGAGTAGGAGATGGA
TTAAGTACTTACGACTTTTGAATCATGTTTTCTTTCTCGAGTAAATAATAAATTATTGCT
TGTAGTATGGTTACAAATGAAAAATACGGGAGCAAGATACTCAAAATTTTTGTGATTAAA
TCATCTCATGAAGACGCATGACATAATAAGAAAGGGAGGGAAGCATATACATGGAGAAAA
AAATAAAGGGAAAAATGAAAATGTGGAGGAGAAGCGTAGGTACCCACGTACGTAGGTGCG
TACCGAAGGTGGAGAGGTGGGACCTCGTAGTATTTAGTTTGTTATAAGATCAATTTAATC
TAATGGTTTATAATATTGGACCCACCGATTTAAGTAAAAATCAAGTAATACATACTTTGT
TTTTTTTCCCTTAGAATTTCTAATATTTTCTCTAATTTATTAGAGCAACACGTGGTAGCT
TAGGGGAATTTTAAGAAATTTTAATGGACTTACCACATGTGATTAGAATATTAACCATGT
AATATATATGTATGGGATATATGTTAATATTCGTTCCAGCTTCGTCCATGTTTATATACA
TTGCTAGCAAGCCATTAGAAGTCTAATTAATACTAGTACATATTTTGACACCTATTTACA
ATTGTTCATAGTAGCTCTTACAGATTCATCTTAGGCTCTTGATTATGCGTTGATCACCAA
TTTACTGATCACCAAGACTCGCATGAAAAAAAAAATACTACTTAGGGTTTAGGACAACTG
ACATTGATGAATTATCTGATTTATGTGATAGCCATTAGAAGTAGTACATATTTTGACACC
TATTTACAATTGTTCATAGTAGCTCTTACAAATTTATCTTACAATCTTGATTATGCGTTG
ATCACCAATTTACTGATCACCAAGACTCGCATGAAAAAAATATGTACTACTTAGGGTTTA
GAACAACTGAAATTAATGAATTATCTGATTTATGTGATAGCTATTTGAGATTATAAACTA
ATGAATTAACTTCTAAAAAGTTAAAAAGTTGTTTTAAAAAGCACCATTTAGGAACTTGGA
AAGCGTGGAGCGTGCAAACAAAAATCCAAAAGATGGAGTCGGAAAAAAAAATACGGCCTT
AAACTCCCTTATAACACTCTTTTCCTAGTAAGAGTTCTATTGATATTCACTAATATTATG
CTCATATTTATAACATAATAATTTCTTTCAACTGTAAGAACAAATAATTGAAAATCCAAA
TACGACATAACACAATACATTATAATTATTATGTAATCAACTCTTGGTTTCAGGTTAAAT
AGAGCTAAGTGCTAGAAAAACTCAACCCACGGTGCTTAATTCTTCTCTTATTGCATCACA
CCTTTAATTTTATTTTTATGTTATTAACAAAATTAATCATCTCCATATACAATTTGATAC
GTGGCAAAGCGGTAGCACTGCAGCAGTCGTGATCACCTTAGATGCCTCAGACACGATGCT
TTGTCACAAAGTGCGAAAGGCAAAGGTGCAAACGTGCATTGTGCAGCAAGCCAAAAAACG
AAAACTTTGGACACTTTATTTATTAACAATAGATCCGATGATGATTTAAATAATGGGTCC
ACCGGTTCTAGTGGAAATGTAAATTAGTTAATATAGATTTTAAATTGTTAATTTAATGGG
TGCACACTATAATGGTGTAGACTTTATTTTAAAACAGTGCATTATTTGAGAATAATAATA
CAAAGTAATGAGTACACCGATTTAAGTAAAAATTATCTTATATTATTAATTTAATGGGTA
AACATATAATGGTGTAAATTTTAGTTACCTGTGTTTATAAGAGTTATATGATGGTATATT
CTTTGTTTGTAAAATTATGATTATTTAATATATATATCAATTGTATAAATGGAAAAAAGA
AGAAAAAAGAAAACAAATGGAGGGAGCCGTACGTACTACATACCCACATGTACGTACGTG
ACCACGGGGGAAAGATGAGAGGTGGGACCCATAGTATTTTGTTTGTTTTATGATCAATTT
TATCTAACGGGGTATAATATTGGACCCACCAATTTAAATGAAAATTAAGGGCTAGATGTT
TTGCTTTTTTATTAGAATTTCTAGGAATTTCTCTAATTTATTAGAGCGCCACGTGGCAAC
TTGAGAGCGATTGTAGGAAGTTTAATGGACTTTTAGTATATAATAGATAGATAGATAGAA
GATTAACTAATACTTAATTAATCACGTGCTAATGGACCACTCCGTTTTTCGTACGGAGAC
AATAAGTTCCCAACCCACATATGAGAACACAGCCTAATTATGCCGCGCTTTTTTAACGTG
ATCTTTCCTCCATATGAGCACCAAAGGTAGCATATGCACTATCCTTTTTTGGTCCAAAAG
GTCCGTAATAAACTCCACTATCACCACTCACCGGCCATGCTAGCATCTCTGCTTGTTGGA
TATCAGACACACAGTACCCTACCACCCTTCATTCTAAAGAGCATTTGTTCTGATTTTCTT
AGTGAACACACCGCTACAAGAATTCAAAGATTCTACTCGTAGTAGCTGATATTTGGTTAA
TTACATTCTGGGGTACACTGAGTAATTTGCAATTTTGCCGTGGGTGCAGGCAAACTCGGC
ACAATGGCAGGACATGAAGCACATCTGGGGGGCCACCTGGAGCCTCACCCCGGGCCCACT
GGTGGGGCCCTTCTCGGTGAGGCTGACAACCCTGACCACCAGGCAGACCCTCTCGGCCCA
GGATGTCATCCCCAAGAACTGGACCCCCAAGGCCACCTACACCTCTCGCCTCAACTTCGC
CTAGAGGAGGCCCCTCCGGCCCATGTTTGATGTTTCGTTGGCTGGGCTCCCCAAGGAGGC
CCATACGGCGTGTTTACTTTCGGATGAATTGTGTCGTTCTTGCGTTGCAGATTGGAGTAA
CTTGTTTTGTGTAGCTATAGCTATTGATGATACCTGCCTAATAGTGCGTGGGACTGGCAT
GTGGGCCCAGGTAGCCCTCTCCGAAGCGGAGAGAGAGCGTGATTTGGTTGGTGTTTTGCT
TGCCTTCCTGGGAGGTTTGGGATCCCATGTGGTTAGAGGCCCCGGTGAAAACATCCTCAG
GGTCTATATAGTTAAGTGCATATATACTTGTATGTGTGTCAAGTAGATGAACTGTATTAT
GTTCCCTAGCTACTTCCCTGTGCGCCCTGAAGTTTGATCTTTGTGTTTGATTATTTTTCT
CGTTACCAACTAATGTATTAATGTTGGTCTTAAACTGATGTTTGCAGTTATTTGTTCTGA
AACTACGCATACAATCAATAATTGGTTTGCCTTC

CDS
>13102.m04731
ATGGCAGCCTTCTCCTCGAGCTCGTCTGCTCCCATGTTGATACGCTCCGTGCTCTTCGTG
TCTCTCCTGTCCGCCGCGTTCGTCTTCGACTCCGGCGAGGCTGGTGCGGCGCACAGGGTG
GTCGACCCGGAGTGGCACCCGGCCACGGCCACCTGGTACGGCAGCGCTGACGGCGACGGC
AGCGACGGCGGCGCGTGTGGATACGGGACGCTGGTGGACGTGGTGCCGATGAAGACGCGG
GTGGGCGCGGTGAGCCCCGTGCTGTTCAAGGGCGGCGAGGGGTGCGGCGCCTGCTACAAG
GTGCGTTGCCTCGACGCCAGCATCTGCTCGCGCCGCGCCGTCACGGTCATCGTCACCGAC
GAGTGCCCCGGCGGCGTCTGCGCCTTCGGCCGCACGCACTTCGACCTCAGCGGCGCCGCC
TTCGCCAGGCTCGCCGTCGCCGGCCACGGCGGCCAGCTGCAGAACCGAGGCGAGATCTCG
GTGGTGTACCGCAGGACGGCGTGCAAGTACGGGGGGAAGAACATTGCCTTCCACGTGAAC
GAGGGCTCGACGACCTTCTGGCTCTCGCTTCTCGTCGAATTCGAGGATGGAGACGGCGAC
ATTGGATCCATGCAGCTAAAACAGGCAAACTCGGCACAATGGCAGGACATGAAGCACATC
TGGGGGGCCACCTGGAGCCTCACCCCGGGCCCACTGGTGGGGCCCTTCTCGGTGAGGCTG
ACAACCCTGACCACCAGGCAGACCCTCTCGGCCCAGGATGTCATCCCCAAGAACTGGACC
CCCAAGGCCACCTACACCTCTCGCCTCAACTTCGCCTAG

Protein
>13102.m04731
[MAAFSSSSSAP]
MLIRSVLFVSLLSAAFVFDSGEAGAAHRVVDPEWHPATATWYGSADGDG
SDGGACGYGTLVDVVPMKTRVGAVSPVLFKGGEGCGACYKVRCLDASICSRRAVTVIVTD
ECPGGVCAFGRTHFDLSGAAFARLAVAGHGGQLQNRGEISVVYRRTACKYGGKNIAFHVN
EGSTTFWLSLLVEFEDGDGDIGSMQLKQANSAQWQDMKHIWGATWSLTPGPLVGPFSVRL
TTLTTRQTLSAQDVIPKNWTPKATYTSRLNFA*
EXPB17
EXPB18
 
EXLA1
EXLA2
EXLA3
EXLA4
 
EXLB1  LOC_Os07g31390 Os07g0496250

Genomic sequence length: 1450 nucleotides
CDS length: 771 nucleotides
Protein length: 256 amino acids   MW:   27576      pI: 5.6552
 

Genomic Sequence
>13107.t02817
CTATTGCATGCACCGCATCTCTTAAGGCTTTGTACGCGTCTCCACGGTCCAAACCACCAA
TGGCTCAGCTGCTTCGTCGGCATCTCCCAGTCATACTTTCTCTCATCTTGTTCCTCTCTA
AGGCTACTGCAGATGCAAACTTCACTGTGTCGAGAGCAGCATATTACCCCAACTCTGATA
TAAAAGGGACCGAAAGTAAGTACCTTTCAGATAAAATTACTTCAATTGGGTTAGCATACC
TCACATGAACCCAAAAAAAGGAAGTTAAATTATACTTAAACTCGGAAATGCATAAAGTTT
CTAAGCTAGCTCATAGAAATTGTACTTGTTTCTAATTGTACTTCTAGACGGTGCATGTGA
GTATGGCGCATTTGGGGCAACACTCAACAATGGTGATGTTTCAGCTTCAGCAAGCCTCTA
CAGGGATGGGGTAGGCTGTGGTGCATGCTACCAGGTAATCCATCTGTATGACTAGGCTGT
GTGAATCATTAAAAAAAAATCCTTTCATTTTTACCAACAATGCCAATTACCAATAGGTGA
GGTGCACGAATCCTTACTATTGCTCTCCAAATGGCGTCACAATTGTGATCACCGACTCTG
GGGCGAGTGATGGCACTGATTTCATCCTCAGCCAGCATGCTTTCACCAGGATGGCGCAGA
GTACAGATGCTGGTACAGCGCTGCTAACCCTTGGCGTGGTTGGGATTGAGTACAGGAGGT
CTGCGCATATAATTAATCTGCCAAAATTCTTGATTGAACGGCATAACTATTTGTTTTCTT
AGATGAAAAATAAAATGAAAACTACTATAGAAACAATATCTGCATAAGGAATTATGGACG
CATGTGCTGTTACTGACATGTCATGGTTCTGGTTTCCTCTACACTACAGGGTTTCTTGTA
CCTACCCAAACAAGAATATCGTTTTCAAGATTACTGAGAGCAGCAATTTCCCCAACTATC
TTGAATTTGAGATCTGGTACCAGCAAGGCAACCAGGACATCATTGCGGTCCAGCTTTGTG
AGGTAAAGTGAAACACTATACAGTAAACAACATAAATTTGTAATTGTAGATGTACAGATC
AAGCGCAAAAACATTCGTAGTACTAAATACCTGTTCATGACTTGATGCAGACTGTGAATT
TGACATGCCAACTTCTGAGCCGGACTCATGGTGCAGTGTGGGCTGCTGTCTCTCCACCAA
GTGGGCCTCTGTCTATAAGGATGTTATTTAGTAGCGGAGCTCCCCGTGGTGGTGACACAT
GGCTAGTTCCGACGAACATAGTACCCCAGAACTGGACGGCAGGGGCCACATATGATTCTG
GGGTCCAAGTACAGCTGCAGTAGCTATATAAAATTAAAAGAAAAATTCTACCAATGCAAA
TGTATTCATAATATTTCATTATTCCCTTGTTTTCTGGTAATGTATATTCAAAGTATTTGT
TGTAATTTGT

CDS
>13107.m03174
ATGGCTCAGCTGCTTCGTCGGCATCTCCCAGTCATACTTTCTCTCATCTTGTTCCTCTCT
AAGGCTACTGCAGATGCAAACTTCACTGTGTCGAGAGCAGCATATTACCCCAACTCTGAT
ATAAAAGGGACCGAAAACGGTGCATGTGAGTATGGCGCATTTGGGGCAACACTCAACAAT
GGTGATGTTTCAGCTTCAGCAAGCCTCTACAGGGATGGGGTAGGCTGTGGTGCATGCTAC
CAGGTGAGGTGCACGAATCCTTACTATTGCTCTCCAAATGGCGTCACAATTGTGATCACC
GACTCTGGGGCGAGTGATGGCACTGATTTCATCCTCAGCCAGCATGCTTTCACCAGGATG
GCGCAGAGTACAGATGCTGGTACAGCGCTGCTAACCCTTGGCGTGGTTGGGATTGAGTAC
AGGAGGGTTTCTTGTACCTACCCAAACAAGAATATCGTTTTCAAGATTACTGAGAGCAGC
AATTTCCCCAACTATCTTGAATTTGAGATCTGGTACCAGCAAGGCAACCAGGACATCATT
GCGGTCCAGCTTTGTGAGACTGTGAATTTGACATGCCAACTTCTGAGCCGGACTCATGGT
GCAGTGTGGGCTGCTGTCTCTCCACCAAGTGGGCCTCTGTCTATAAGGATGTTATTTAGT
AGCGGAGCTCCCCGTGGTGGTGACACATGGCTAGTTCCGACGAACATAGTACCCCAGAAC
TGGACGGCAGGGGCCACATATGATTCTGGGGTCCAAGTACAGCTGCAGTAG

Protein
>13107.m03174
MAQLLRRHLPVILSLILFLSKATADANFTVSRAAYYPNSDIKGTENGACEYGAFGATLNN
GDVSASASLYRDGVGCGACYQVRCTNPYYCSPNGVTIVITDSGASDGTDFILSQHAFTRM
AQSTDAGTALLTLGVVGIEYRRVSCTYPNKNIVFKITESSNFPNYLEFEIWYQQGNQDII
AVQLCETVNLTCQLLSRTHGAVWAAVSPPSGPLSIRMLFSSGAPRGGDTWLVPTNIVPQN
WTAGATYDSGVQVQLQ*
 

 

This page was last updated on 03/22/06.

Send Comments to mailto:dcosgrove@psu.edu