| Database | ID | Description |
|---|---|---|
| Coils | Coil | Coil |
| Gene3D | G3DSA:1.20.120.520 | nmb1532 protein domain like |
| PANTHER | PTHR35585 | HHE DOMAIN PROTEIN (AFU_ORTHOLOGUE AFUA_4G00730) |
| Pfam | PF01814 | Hemerythrin HHE cation binding domain |
| MapolyID | Mapoly0042s0093 | - |
No gene symbols are registered for this gene.
Marchantia hemerythrin (Hr) gene used as a comparator ortholog/BLAST query to map Hr gene distribution across early land plants; two to three intact Hr copies found in M. polymorpha versus a degraded partial sequence in the moss P. patens.
0 core 1 peripheral
Sequences: |
Gene UTR + CDS + intron |
| >Mp2g14710.1 GAACGTTCTG AGCGACGAGT CGAATCTACT GAGAAAGAAT AATCTGATTC AGAAAGGAAA AAGTTACTAT TCTAGCTCAG TTTGTTGACG ATGTCGAGAA CGATGGCACA AGTTGCAAGT TCAGTAATTT GCATGGCTTC GGCTCAGCAG GCAGCATCAA CATTGGGCAA CATAAAATCT GCGGTCTGTA CATCACACGA CCATTTACTC ATTCATGCTC ATCGATGGTT TGGAGTTATT GAATACACGT TCTCATATGA ATGAACAAAC CTGCTCGTTA CTTCTGCGAA ATATCAGGCT GACGTTCATT GGAACACTTT TTCAGAAGTT GGCACCGTAT TCGAGTCTTT CAGGAAGGGC TCGGAGAACA ACTGTCAGTG TCAGTTCGCC CGCTCTCGTG CACTCGTGGC AGGTAGTCTC ATTCTCTGCG TTTCTAAATC GTTCCCTGTT TCTGGCTAGA TGGAATAAGA TTTGAAACTA GAGTGCCTTT TTTCCTCAGT CTGAATAATG CTCTTCAGAC CAGTCTTCTT TCTTTTCCAG AAGCCCACGA GTAGCTGACA TTCTAGGAAA TACTATCAAA AGTAGTTCCC TGTGTTGACA GTTGTTTTGT TCTTTATATT CAATGCAATA TCTACTGATA AGTGTTCAGA GCTCCAGGCC GTGTTCAAAA GGTAAAGAAA ATGTTCATAT CAATGAAGAA CTGTCAGAAG GTTACTGACT GCCAATGGTC GAGAAATACA GGGGCGTCAA ATTCGAGCGA AAATCTCAGC CGAGAGAGGA GGAGAGGAGG AGGAGGAGGT GACAACAAAA GAAGATATCA TTTTAGAAGT GAAGCACGAC CACGCGGAGT TGGAAGAGTG CTTTCAGAGG TACAAGAAAG CACACTCTAA GGGTCAAGAC CACGAGGCCA GAAATTTGTT CAACCAATTC GTGTGGGAAA TTTCTCGTCA CTCGGTGTCG GAGGAGCTCA TCCTCTATCC CATGATGGAT CTACTAGGGG ATAGGGGCAA GGAGTTAGCC GACCAATCCC GGGAAGATCA CCACCGAACC AAAGAGATTC TGGCCGAGCT TCAGACCATT TCCGATCCAT CACTGTTCGA GAAAAGGCTA AACATCATGA TGGCAGAGTT GAGGGACCAT ATGAAGATGG AGGAGGAGGA AGACTTAGCT TATTTGGAGG CGAGAACCGA TCTCGCCACG CGAGTGACAG CAGGGAGGAC TTTTTCTTCG GGGAAGAAGA TAGTGCCAAC TCATCCCCAT CCCGAGATCC CAGACAATTT CGTGGCATTG GAAGTCGCTC TGCGACTACT GCGAAATCTC TTCACTCTTT CCCCTTCGAA GGGCGAGTCT AAGTGACAGT AAATCGTATA TCGTGATAGT CTAGTAAGTT TCTAGTCCAA AGGTTTGACC GTCATCACAA GTAAGGGGAA TATAGGTGAT GGTCAATACG TTAGATTCCC CAGCACAGAG ACCAGATTGC GTCCTCATGG CGCCTGAATC GATTTGGTTC AGTCTTGTTC GGTAGAGGCT TACCGCCCTC TTTCCCGATT GCTGTGGGCA AGGCCACACA GCTACGATGG GGGCTATCGA GGGTAAGGTC GGTCTTCTTC CCTTGTGCGT CCCAATCAGA GGGTAAAATG TCCTTGTACT TATTACTGCC CTGGCCTCGG GATCACAGTC CTCCGACAGC GTTCTAGTCT TTCGAGTCCT TCGAGTTCTG AGATGTTGGA ACTCGTCTGT TGCTTGAAAG ATGGCCATCG GAGCCTTAGA TTGTCTAGAA TGACTTCACC CACCGGCCCC CGGGTAGATC GATGACGGCA TTATCTGCCG GCGCTGTACG TGCTCGAGTG GTGGGTGTCG AGGCTCGAGG TCATCCTCTT CTGTCTACAG AATGGCAAAG TTGTGTACTA AAACTTGTAA TAACACATTT TTCAAGGCGA TTGTGTCGT |
mRNA UTR + CDS |
| >Mp2g14710.1 GAACGTTCTG AGCGACGAGT CGAATCTACT GAGAAAGAAT AATCTGATTC AGAAAGGAAA AAGTTACTAT TCTAGCTCAG TTTGTTGACG ATGTCGAGAA CGATGGCACA AGTTGCAAGT TCAGTAATTT GCATGGCTTC GGCTCAGCAG GCAGCATCAA CATTGGGCAA CATAAAATCT GCGAAGTTGG CACCGTATTC GAGTCTTTCA GGAAGGGCTC GGAGAACAAC TGTCAGTGTC AGTTCGCCCG CTCTCGTGCA CTCGTGGCAG GGGCGTCAAA TTCGAGCGAA AATCTCAGCC GAGAGAGGAG GAGAGGAGGA GGAGGAGGTG ACAACAAAAG AAGATATCAT TTTAGAAGTG AAGCACGACC ACGCGGAGTT GGAAGAGTGC TTTCAGAGGT ACAAGAAAGC ACACTCTAAG GGTCAAGACC ACGAGGCCAG AAATTTGTTC AACCAATTCG TGTGGGAAAT TTCTCGTCAC TCGGTGTCGG AGGAGCTCAT CCTCTATCCC ATGATGGATC TACTAGGGGA TAGGGGCAAG GAGTTAGCCG ACCAATCCCG GGAAGATCAC CACCGAACCA AAGAGATTCT GGCCGAGCTT CAGACCATTT CCGATCCATC ACTGTTCGAG AAAAGGCTAA ACATCATGAT GGCAGAGTTG AGGGACCATA TGAAGATGGA GGAGGAGGAA GACTTAGCTT ATTTGGAGGC GAGAACCGAT CTCGCCACGC GAGTGACAGC AGGGAGGACT TTTTCTTCGG GGAAGAAGAT AGTGCCAACT CATCCCCATC CCGAGATCCC AGACAATTTC GTGGCATTGG AAGTCGCTCT GCGACTACTG CGAAATCTCT TCACTCTTTC CCCTTCGAAG GGCGAGTCTA AGTGACAGTA AATCGTATAT CGTGATAGTC TAGTAAGTTT CTAGTCCAAA GGTTTGACCG TCATCACAAG TAAGGGGAAT ATAGGTGATG GTCAATACGT TAGATTCCCC AGCACAGAGA CCAGATTGCG TCCTCATGGC GCCTGAATCG ATTTGGTTCA GTCTTGTTCG GTAGAGGCTT ACCGCCCTCT TTCCCGATTG CTGTGGGCAA GGCCACACAG CTACGATGGG GGCTATCGAG GGTAAGGTCG GTCTTCTTCC CTTGTGCGTC CCAATCAGAG GGTAAAATGT CCTTGTACTT ATTACTGCCC TGGCCTCGGG ATCACAGTCC TCCGACAGCG TTCTAGTCTT TCGAGTCCTT CGAGTTCTGA GATGTTGGAA CTCGTCTGTT GCTTGAAAGA TGGCCATCGG AGCCTTAGAT TGTCTAGAAT GACTTCACCC ACCGGCCCCC GGGTAGATCG ATGACGGCAT TATCTGCCGG CGCTGTACGT GCTCGAGTGG TGGGTGTCGA GGCTCGAGGT CATCCTCTTC TGTCTACAGA ATGGCAAAGT TGTGTACTAA AACTTGTAAT AACACATTTT TCAAGGCGAT TGTGTCGT |
CDS |
| >Mp2g14710.1 ATGTCGAGAA CGATGGCACA AGTTGCAAGT TCAGTAATTT GCATGGCTTC GGCTCAGCAG GCAGCATCAA CATTGGGCAA CATAAAATCT GCGAAGTTGG CACCGTATTC GAGTCTTTCA GGAAGGGCTC GGAGAACAAC TGTCAGTGTC AGTTCGCCCG CTCTCGTGCA CTCGTGGCAG GGGCGTCAAA TTCGAGCGAA AATCTCAGCC GAGAGAGGAG GAGAGGAGGA GGAGGAGGTG ACAACAAAAG AAGATATCAT TTTAGAAGTG AAGCACGACC ACGCGGAGTT GGAAGAGTGC TTTCAGAGGT ACAAGAAAGC ACACTCTAAG GGTCAAGACC ACGAGGCCAG AAATTTGTTC AACCAATTCG TGTGGGAAAT TTCTCGTCAC TCGGTGTCGG AGGAGCTCAT CCTCTATCCC ATGATGGATC TACTAGGGGA TAGGGGCAAG GAGTTAGCCG ACCAATCCCG GGAAGATCAC CACCGAACCA AAGAGATTCT GGCCGAGCTT CAGACCATTT CCGATCCATC ACTGTTCGAG AAAAGGCTAA ACATCATGAT GGCAGAGTTG AGGGACCATA TGAAGATGGA GGAGGAGGAA GACTTAGCTT ATTTGGAGGC GAGAACCGAT CTCGCCACGC GAGTGACAGC AGGGAGGACT TTTTCTTCGG GGAAGAAGAT AGTGCCAACT CATCCCCATC CCGAGATCCC AGACAATTTC GTGGCATTGG AAGTCGCTCT GCGACTACTG CGAAATCTCT TCACTCTTTC CCCTTCGAAG GGCGAGTCTA AGTGA |
Protein |
| >Mp2g14710.1 MSRTMAQVAS SVICMASAQQ AASTLGNIKS AKLAPYSSLS GRARRTTVSV SSPALVHSWQ GRQIRAKISA ERGGEEEEEV TTKEDIILEV KHDHAELEEC FQRYKKAHSK GQDHEARNLF NQFVWEISRH SVSEELILYP MMDLLGDRGK ELADQSREDH HRTKEILAEL QTISDPSLFE KRLNIMMAEL RDHMKMEEEE DLAYLEARTD LATRVTAGRT FSSGKKIVPT HPHPEIPDNF VALEVALRLL RNLFTLSPSK GESK |