| Database | ID | Description |
|---|---|---|
| MobiDBLite | mobidb-lite | consensus disorder prediction |
| SMART | SM00526 | h15plus2 |
| Gene3D | G3DSA:1.10.10.10 | - |
| SUPERFAMILY | SSF46785 | "Winged helix" DNA-binding domain |
| ProSiteProfiles | PS51504 | Linker histone H1/H5 globular (H15) domain profile. |
| Pfam | PF00538 | linker histone H1 and H5 family |
| MapolyID | Mapoly0101s0021 | - |
| GO | GO:0006334 | nucleosome assembly |
| GO | GO:0000786 | nucleosome |
| GO | GO:0003677 | DNA binding |
No gene symbols are registered for this gene.
Target of MpmiR11796, encoding a linker histone H1/H5 family protein. Its mRNA is expressed exclusively in antheridiophores, where the targeting miRNA is lowest.
1 core 0 peripheral
Sequences: |
Gene UTR + CDS + intron |
| >Mp4g20750.1 GTAGACCCAG TCCTCGCTCC AACGCCGGCC GCTATGGTCC CATAGTCTGG ACGGCTCTCG CTGATGAAAG TTTCCTTAGA TTGATTTCCT TTTTTTCTTT CTTTCCTCAC CCTCCTCTCT ATCATCTCTC GCCCTCAATC CTCATCACCA ATTTCGGGAA TTCAGACATT CTCTCTCTGG AATATCAAAA TCGAGCTTTC AAATCACCGT CACGAGCAGC AAAATTGACG CAATTCGAAA GCGAAAGCGA CAATGAGAGG TCGCTCGAGA AGCTACAGCA GGTGTTCGAG CAGGGGGTCG GACGACGACA TCGACGACGA CGATGCGTAC TCGGACTATT CCGACACGTC GATGGCGTCG AGCAGGTCTA GATCTAGACG GAGCAGCCCG GGGGGGAGTG TGCTGAGCGA CTACGACAGC TACAACTCCT CCCGGGAGCC ATCGCGAGAA AGGCACGTAC CGACGGAAAC GCGCAGCCTG CGGGCGGACA TCAGGACCAA CAGCGGGCTG ATGCGGGCGG TGCAGTCGGC GGTGGCGAGG CTGAAGAAGA AGGGCGGGGC GCCGGTGAAG AGCATCAAGC AGGACATTGC GCACAAATTC GGAGTGACAT TGGCGGGGTC GCACCACACG CATCGCCTCC ATTCGGCGCT CAAGACGCTG CAGAAGGCAG GGAAGCTCGT GAAGACGCAG CAGGGCTACA AGCTCGTGCC GCATGGCGGC GGCCAGCAGG TGATGCGCAG GAGGAGGAGG CGGAGGCATG GGCGCAAGGG CAAAAAGCCC AAGCGCCATA GAAGAAGAAG AAGAAGAAGG CGCCGCGGGC GCCGCAAGGC GCACCGCGGG CGCAAGAAGA AGCGCGGCGG GCGGCGCAAG GGCAAACGGC GCCACAGAAG AAGAAGACGC GGCAAGAGCC ACTCCGGCGT GTCGTCGGCC TCCAAGCCGA ACACGCGGTC GCGCACCGCG GCCCTGGCCA AGCGCCGCGG CAAGGCTAAG GCCTCCGCGT CTCGCGCCTC GCGCGGCCGC CAGCGCGCCA TCGGCGTGCC CGGCGAGTCC ACGCGCGGCG ACTACACTCC CCCCGCCGCC CGCAGCTCGC CCCGCCTGCC CTCCCGGAAC CCGTCGCGCG TCCCGTCCCG CGTGCCGTCG CGGATGCCCT CTAGAATGCC GTCGCGCAGT CACTCTCCGG CGCGCGGCGG CTTTGACCCC CGCGAGACGC CCCCCAGCCG CGACTCGCCG GGCCCCCACG TCCGGAACTC TCTGGACGAC GACTTCGACC AGTGAACGTT AAACCTTCAA GACGAGCCCC GCGTCCATCA TCACCACCAC CACCCCGACC TTCATCCTCC TCCTCCTGCT CCTCATCATC ATCCCCATCA TTCATCGCGA CCCATTTTCT GTGGAGGCCT GGACGACGAG GAATATTCCA GGCTCGAACT CCCTCAAGAC ATGAATTCCA AGATCATGGC CGACGTTTGG CATCCCATTT GCGAGAGCTT TCACAAGGAA TATATTTGAT CCGGTCGGTC GCCCTTCCGC CCTCGCGGCC ATGAATGCCA TGGACGACGA CGACGACGAG CAGCGTCGAC CCGATCACAC CTGACACTAC GCGAGCGAGC GAGTGAGCGA GCGAGCGCAA TTTCGTTTCT TTCAGTTTTA ATAAGAGGCC CCTCTCTCTC AATCTCGACG AGAAATCCTG TTGGGAGCGA AGGGAGGAGA AGAAGAACAA CCGCGACGAT GGAGAACGGC GACAATCGCT GGCGATTCAT TTCGACCGAA GCCAGAACTT CCACATTCGT AGATAAAATC TGTTCGACTC CAAAAAAATT TGTCCTGTCG TAGTAGCACT GAAAGACTTT TCTTCAGACT GCTCTTGAAC ACCTTTAAGT GTAATGCTTG GCTGTACTAC GTGCGTTTTC GCGTTCTTTC CTTAGTCTAT TCAGATCGAT CAAATCGATT GATGGGACTT TCTACAAGAG CCTATGCGTC CTGTATATAT CGATTCATAC AGATATCGTC AGTCTGTTGC GGTGGCTGTA TGAAGGCTGC TTGGGCCGCC TCGCGCCGAG GAACTCTTTC CAAGGATATG CCGCTGCAGC TTCCTTTGGA AATCCATCTG CTGAGTCTCG AGAGTACGTC ACGGCTGCCA TTTAGTCCAA GAATACGTAG ACCTCTAGCT CGGGATTCGA TGCGGAGGCC GGGGCCAGAT TACCGTAGTT TGACTGTCCA AGAATTTGAT CAGACATGTA CATGAACACT GTTAATTTCA TATTCTTTTA TAAATTCTTG TATGCGAACA TTGAGGTCAA |
mRNA UTR + CDS |
| >Mp4g20750.1 GTAGACCCAG TCCTCGCTCC AACGCCGGCC GCTATGGTCC CATAGTCTGG ACGGCTCTCG CTGATGAAAG TTTCCTTAGA TTGATTTCCT TTTTTTCTTT CTTTCCTCAC CCTCCTCTCT ATCATCTCTC GCCCTCAATC CTCATCACCA ATTTCGGGAA TTCAGACATT CTCTCTCTGG AATATCAAAA TCGAGCTTTC AAATCACCGT CACGAGCAGC AAAATTGACG CAATTCGAAA GCGAAAGCGA CAATGAGAGG TCGCTCGAGA AGCTACAGCA GGTGTTCGAG CAGGGGGTCG GACGACGACA TCGACGACGA CGATGCGTAC TCGGACTATT CCGACACGTC GATGGCGTCG AGCAGGTCTA GATCTAGACG GAGCAGCCCG GGGGGGAGTG TGCTGAGCGA CTACGACAGC TACAACTCCT CCCGGGAGCC ATCGCGAGAA AGGCACGTAC CGACGGAAAC GCGCAGCCTG CGGGCGGACA TCAGGACCAA CAGCGGGCTG ATGCGGGCGG TGCAGTCGGC GGTGGCGAGG CTGAAGAAGA AGGGCGGGGC GCCGGTGAAG AGCATCAAGC AGGACATTGC GCACAAATTC GGAGTGACAT TGGCGGGGTC GCACCACACG CATCGCCTCC ATTCGGCGCT CAAGACGCTG CAGAAGGCAG GGAAGCTCGT GAAGACGCAG CAGGGCTACA AGCTCGTGCC GCATGGCGGC GGCCAGCAGG TGATGCGCAG GAGGAGGAGG CGGAGGCATG GGCGCAAGGG CAAAAAGCCC AAGCGCCATA GAAGAAGAAG AAGAAGAAGG CGCCGCGGGC GCCGCAAGGC GCACCGCGGG CGCAAGAAGA AGCGCGGCGG GCGGCGCAAG GGCAAACGGC GCCACAGAAG AAGAAGACGC GGCAAGAGCC ACTCCGGCGT GTCGTCGGCC TCCAAGCCGA ACACGCGGTC GCGCACCGCG GCCCTGGCCA AGCGCCGCGG CAAGGCTAAG GCCTCCGCGT CTCGCGCCTC GCGCGGCCGC CAGCGCGCCA TCGGCGTGCC CGGCGAGTCC ACGCGCGGCG ACTACACTCC CCCCGCCGCC CGCAGCTCGC CCCGCCTGCC CTCCCGGAAC CCGTCGCGCG TCCCGTCCCG CAATGCCGTC GCGCAGTCAC TCTCCGGCGC GCGGCGGCTT TGACCCCCGC GAGACGCCCC CCAGCCGCGA CTCGCCGGGC CCCCACGTCC GGAACTCTCT GGACGACGAC TTCGACCAGT GAACGTTAAA CCTTCAAGAC GAGCCCCGCG TCCATCATCA CCACCACCAC CCCGACCTTC ATCCTCCTCC TCCTGCTCCT CATCATCATC CCCATCATTC ATCGCGACCC ATTTTCTGTG GAGGCCTGGA CGACGAGGAA TATTCCAGGC TCGAACTCCC TCAAGACATG AATTCCAAGA TCATGGCCGA CGTTTGGCAT CCCATTTGCG AGAGCTTTCA CAAGGAATAT ATTTGATCCG GTCGGTCGCC CTTCCGCCCT CGCGGCCATG AATGCCATGG ACGACGACGA CGACGAGCAG CGTCGACCCG ATCACACCTG ACACTACGCG AGCGAGCGAG TGAGCGAGCG AGCGCAATTT CGTTTCTTTC AGTTTTAATA AGAGGCCCCT CTCTCTCAAT CTCGACGAGA AATCCTGTTG GGAGCGAAGG GAGGAGAAGA AGAACAACCG CGACGATGGA GAACGGCGAC AATCGCTGGC GATTCATTTC GACCGAAGCC AGAACTTCCA CATTCGTAGA TAAAATCTGT TCGACTCCAA AAAAATTTGT CCTGTCGTAG TAGCACTGAA AGACTTTTCT TCAGACTGCT CTTGAACACC TTTAAGTGTA ATGCTTGGCT GTACTACGTG CGTTTTCGCG TTCTTTCCTT AGTCTATTCA GATCGATCAA ATCGATTGAT GGGACTTTCT ACAAGAGCCT ATGCGTCCTG TATATATCGA TTCATACAGA TATCGTCAGT CTGTTGCGGT GGCTGTATGA AGGCTGCTTG GGCCGCCTCG CGCCGAGGAA CTCTTTCCAA GGATATGCCG CTGCAGCTTC CTTTGGAAAT CCATCTGCTG AGTCTCGAGA GTACGTCACG GCTGCCATTT AGTCCAAGAA TACGTAGACC TCTAGCTCGG GATTCGATGC GGAGGCCGGG GCCAGATTAC CGTAGTTTGA CTGTCCAAGA ATTTGATCAG ACATGTACAT GAACACTGTT AATTTCATAT TCTTTTATAA ATTCTTGTAT GCGAACATTG AGGTCAA |
CDS |
| >Mp4g20750.1 ATGAGAGGTC GCTCGAGAAG CTACAGCAGG TGTTCGAGCA GGGGGTCGGA CGACGACATC GACGACGACG ATGCGTACTC GGACTATTCC GACACGTCGA TGGCGTCGAG CAGGTCTAGA TCTAGACGGA GCAGCCCGGG GGGGAGTGTG CTGAGCGACT ACGACAGCTA CAACTCCTCC CGGGAGCCAT CGCGAGAAAG GCACGTACCG ACGGAAACGC GCAGCCTGCG GGCGGACATC AGGACCAACA GCGGGCTGAT GCGGGCGGTG CAGTCGGCGG TGGCGAGGCT GAAGAAGAAG GGCGGGGCGC CGGTGAAGAG CATCAAGCAG GACATTGCGC ACAAATTCGG AGTGACATTG GCGGGGTCGC ACCACACGCA TCGCCTCCAT TCGGCGCTCA AGACGCTGCA GAAGGCAGGG AAGCTCGTGA AGACGCAGCA GGGCTACAAG CTCGTGCCGC ATGGCGGCGG CCAGCAGGTG ATGCGCAGGA GGAGGAGGCG GAGGCATGGG CGCAAGGGCA AAAAGCCCAA GCGCCATAGA AGAAGAAGAA GAAGAAGGCG CCGCGGGCGC CGCAAGGCGC ACCGCGGGCG CAAGAAGAAG CGCGGCGGGC GGCGCAAGGG CAAACGGCGC CACAGAAGAA GAAGACGCGG CAAGAGCCAC TCCGGCGTGT CGTCGGCCTC CAAGCCGAAC ACGCGGTCGC GCACCGCGGC CCTGGCCAAG CGCCGCGGCA AGGCTAAGGC CTCCGCGTCT CGCGCCTCGC GCGGCCGCCA GCGCGCCATC GGCGTGCCCG GCGAGTCCAC GCGCGGCGAC TACACTCCCC CCGCCGCCCG CAGCTCGCCC CGCCTGCCCT CCCGGAACCC GTCGCGCGTC CCGTCCCGCA ATGCCGTCGC GCAGTCACTC TCCGGCGCGC GGCGGCTTTG A |
Protein |
| >Mp4g20750.1 MRGRSRSYSR CSSRGSDDDI DDDDAYSDYS DTSMASSRSR SRRSSPGGSV LSDYDSYNSS REPSRERHVP TETRSLRADI RTNSGLMRAV QSAVARLKKK GGAPVKSIKQ DIAHKFGVTL AGSHHTHRLH SALKTLQKAG KLVKTQQGYK LVPHGGGQQV MRRRRRRRHG RKGKKPKRHR RRRRRRRRGR RKAHRGRKKK RGGRRKGKRR HRRRRRGKSH SGVSSASKPN TRSRTAALAK RRGKAKASAS RASRGRQRAI GVPGESTRGD YTPPAARSSP RLPSRNPSRV PSRNAVAQSL SGARRL |