| Database | ID | Description |
|---|---|---|
| Gene3D | G3DSA:1.20.120.520 | nmb1532 protein domain like |
| Pfam | PF01814 | Hemerythrin HHE cation binding domain |
| PANTHER | PTHR35585 | HHE DOMAIN PROTEIN (AFU_ORTHOLOGUE AFUA_4G00730) |
| MapolyID | Mapoly0112s0056 | - |
No gene symbols are registered for this gene.
Marchantia hemerythrin (Hr) gene used as a comparator ortholog/BLAST query to map Hr gene distribution across early land plants; two to three intact Hr copies found in M. polymorpha versus a degraded partial sequence in the moss P. patens.
0 core 1 peripheral
Sequences: |
Gene UTR + CDS + intron |
| >Mp4g09510.1 GAGCATTCCT TGAATCGTCA TCAGCTGCAC AAGACATTCC GACGTTCAGA TTCATTACTC AGTCGAATAC AATGGCGAGA GCAGTCATGG CAAGAGTTGG AGTTCCTGCA GTTTACTTCT CCGTCGCGAG GGCAGCACCA GCATTGTGTA ACAGGAGTTC AGTGGTACTC ATCTGATCTT GGTCTTCCGA TATTGCCTAT CGTTTCTTGT CTGAACTGAG AAGCTATGGG AGACATGAAC TGCTGTTGAG CTGATTATAT TCTTCGTTGT TCAAGCTTGG ACATGAGCTG CTTACGCTGA CTGGTACATG TGCATCTTGC AGATGTTCGC ACCTGCTTCC AGGTTCTCAG GACGGGCTCT GAACTCCACC AATTTGAGAC AAGCTGTGTC TTCTCGCACT TGGCAGGTAA GGCTGTTCCA GACCTCATCA AGGCTAAGAA TAAATACGAT TCACCCACAC CACTCAGCTT CAAATTGCCT ATAGTGTTCG TCACCTGCTT AGTTGTTGGA AGCTAGAGTT TGTAAGTTCT CATCACCCGC ACAGATTCCA TCATTCTCAG GACTACTGAA CTTACTTCAA TAATCATGAT TTTGACACTT AACGGGTTCC AGCACATTAC CATTGACTTT TAGAGGAAGA TTGTTTAGCA CAATTTTTGT CTGCTGAGGT CGAGCCCTTC CCAGTCAAAC TGTACGTGAT GTCGTGAACA GAACGTTATA TGGAGAACAA ACAAATCTGT CGTGATCTCT GTAAGCATGA ACTTGTAAGA ACTCTTCGGT GAATGTGGTG AATGTGACCG CACAGAGAAG CTAATGTTTG CTTGTGGAAT GTATACAGGG GGTTCAGGTC CGAGCCCGAA GCACAGCGCA GGAAGAAAAG GTGGCAGTAG GAGATGATAT CATAGACAAG ATCAATCACG ATCACCGCGA GTTGGAACAT TTCTTCCAAA AATATAAGGA TTCACACAAA CAGGGTGATG ATATGGAGGC AAGGAAATGG TTTAACCAGT TCGTGTGGGA AGTCTCTCGT CACGCAGTAT GTGAGGAACT TATCTTCTAT CCGATGCTAA CGAATATGGG AGAAGAGGGG AAGAAAATGG CTGATGAGGC CATTGAGTAT CACCACAAGA CGAAGGAGAT TCTTGCTGAC CTCCAGACTA CATTGGATAT GGACGAATTC GACCGAAAGA TGGACAAAAT GTGGGCGGAC CTGAAGGAAC ATATGAACAC GGAGGAAAGT GAGGATCTGC CGTTCCTCCG AGACAATACC GATGTCTCGA GTCGAGAGAA TGCTGGAAAG ACTTTCGCTC TGGGTAAAAA CCTCGTCCCA ACTCATCCCC ATCCCGCAGT TCCTGAAAGG CCCGTGGCCT TAGAAGCTGC TGTGGGACTA CTTATCACCC CGTTGGACAA ACTTCGTGAT ATCTTCACTC CTTTTCCATC ACAAAACAAA TCTGTAAACT AAATCCACGT AAGGTCGGTG GAGTCCGGGC TCGTTGAAAA TAGTCCATAG CTTCCGTGTA CATTCGATCA GTTGAGGTAG GCAATCTATG TCGTGTGCAG CTTATGCTCC GTCCTGTTAC TCTAGCTTAG TCTTAGTAGG ACTGGAAATG CTCTCACAGT AACGGGGTAT AATTACTCCC TAAGTTTGGG TGAGAGAAAC CTGTTCTCTA GTACATATGT GTAAGAATAA AGATGAATAC TTG |
mRNA UTR + CDS |
| >Mp4g09510.1 GAGCATTCCT TGAATCGTCA TCAGCTGCAC AAGACATTCC GACGTTCAGA TTCATTACTC AGTCGAATAC AATGGCGAGA GCAGTCATGG CAAGAGTTGG AGTTCCTGCA GTTTACTTCT CCGTCGCGAG GGCAGCACCA GCATTGTGTA ACAGGAGTTC AGTGATGTTC GCACCTGCTT CCAGGTTCTC AGGACGGGCT CTGAACTCCA CCAATTTGAG ACAAGCTGTG TCTTCTCGCA CTTGGCAGGG GGTTCAGGTC CGAGCCCGAA GCACAGCGCA GGAAGAAAAG GTGGCAGTAG GAGATGATAT CATAGACAAG ATCAATCACG ATCACCGCGA GTTGGAACAT TTCTTCCAAA AATATAAGGA TTCACACAAA CAGGGTGATG ATATGGAGGC AAGGAAATGG TTTAACCAGT TCGTGTGGGA AGTCTCTCGT CACGCAGTAT GTGAGGAACT TATCTTCTAT CCGATGCTAA CGAATATGGG AGAAGAGGGG AAGAAAATGG CTGATGAGGC CATTGAGTAT CACCACAAGA CGAAGGAGAT TCTTGCTGAC CTCCAGACTA CATTGGATAT GGACGAATTC GACCGAAAGA TGGACAAAAT GTGGGCGGAC CTGAAGGAAC ATATGAACAC GGAGGAAAGT GAGGATCTGC CGTTCCTCCG AGACAATACC GATGTCTCGA GTCGAGAGAA TGCTGGAAAG ACTTTCGCTC TGGGTAAAAA CCTCGTCCCA ACTCATCCCC ATCCCGCAGT TCCTGAAAGG CCCGTGGCCT TAGAAGCTGC TGTGGGACTA CTTATCACCC CGTTGGACAA ACTTCGTGAT ATCTTCACTC CTTTTCCATC ACAAAACAAA TCTGTAAACT AAATCCACGT AAGGTCGGTG GAGTCCGGGC TCGTTGAAAA TAGTCCATAG CTTCCGTGTA CATTCGATCA GTTGAGGTAG GCAATCTATG TCGTGTGCAG CTTATGCTCC GTCCTGTTAC TCTAGCTTAG TCTTAGTAGG ACTGGAAATG CTCTCACAGT AACGGGGTAT AATTACTCCC TAAGTTTGGG TGAGAGAAAC CTGTTCTCTA GTACATATGT GTAAGAATAA AGATGAATAC TTG |
CDS |
| >Mp4g09510.1 ATGGCGAGAG CAGTCATGGC AAGAGTTGGA GTTCCTGCAG TTTACTTCTC CGTCGCGAGG GCAGCACCAG CATTGTGTAA CAGGAGTTCA GTGATGTTCG CACCTGCTTC CAGGTTCTCA GGACGGGCTC TGAACTCCAC CAATTTGAGA CAAGCTGTGT CTTCTCGCAC TTGGCAGGGG GTTCAGGTCC GAGCCCGAAG CACAGCGCAG GAAGAAAAGG TGGCAGTAGG AGATGATATC ATAGACAAGA TCAATCACGA TCACCGCGAG TTGGAACATT TCTTCCAAAA ATATAAGGAT TCACACAAAC AGGGTGATGA TATGGAGGCA AGGAAATGGT TTAACCAGTT CGTGTGGGAA GTCTCTCGTC ACGCAGTATG TGAGGAACTT ATCTTCTATC CGATGCTAAC GAATATGGGA GAAGAGGGGA AGAAAATGGC TGATGAGGCC ATTGAGTATC ACCACAAGAC GAAGGAGATT CTTGCTGACC TCCAGACTAC ATTGGATATG GACGAATTCG ACCGAAAGAT GGACAAAATG TGGGCGGACC TGAAGGAACA TATGAACACG GAGGAAAGTG AGGATCTGCC GTTCCTCCGA GACAATACCG ATGTCTCGAG TCGAGAGAAT GCTGGAAAGA CTTTCGCTCT GGGTAAAAAC CTCGTCCCAA CTCATCCCCA TCCCGCAGTT CCTGAAAGGC CCGTGGCCTT AGAAGCTGCT GTGGGACTAC TTATCACCCC GTTGGACAAA CTTCGTGATA TCTTCACTCC TTTTCCATCA CAAAACAAAT CTGTAAACTA A |
Protein |
| >Mp4g09510.1 MARAVMARVG VPAVYFSVAR AAPALCNRSS VMFAPASRFS GRALNSTNLR QAVSSRTWQG VQVRARSTAQ EEKVAVGDDI IDKINHDHRE LEHFFQKYKD SHKQGDDMEA RKWFNQFVWE VSRHAVCEEL IFYPMLTNMG EEGKKMADEA IEYHHKTKEI LADLQTTLDM DEFDRKMDKM WADLKEHMNT EESEDLPFLR DNTDVSSREN AGKTFALGKN LVPTHPHPAV PERPVALEAA VGLLITPLDK LRDIFTPFPS QNKSVN |