Showing Protein Cytosine-specific methyltransferase (HMDBP10631)
Identification | |||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
HMDB Protein ID | HMDBP10631 | ||||||||||||||||||||||||||||||
Secondary Accession Numbers |
|
||||||||||||||||||||||||||||||
Name | Cytosine-specific methyltransferase | ||||||||||||||||||||||||||||||
Synonyms | Not Available | ||||||||||||||||||||||||||||||
Gene Name | Not Available | ||||||||||||||||||||||||||||||
Protein Type | Enzyme | ||||||||||||||||||||||||||||||
Biological Properties | |||||||||||||||||||||||||||||||
General Function | Involved in DNA binding | ||||||||||||||||||||||||||||||
Specific Function | S-adenosyl-L-methionine + DNA = S-adenosyl-L- homocysteine + DNA containing 5-methylcytosine | ||||||||||||||||||||||||||||||
Pathways | Not Available | ||||||||||||||||||||||||||||||
Reactions | Not Available | ||||||||||||||||||||||||||||||
GO Classification |
|
||||||||||||||||||||||||||||||
Cellular Location |
|
||||||||||||||||||||||||||||||
Gene Properties | |||||||||||||||||||||||||||||||
Chromosome Location | Not Available | ||||||||||||||||||||||||||||||
Locus | Not Available | ||||||||||||||||||||||||||||||
SNPs | Not Available | ||||||||||||||||||||||||||||||
Gene Sequence |
>4822 bp GGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGATTTGGAAAG AGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCT GCAAACAGAAATAAAGAATCAGTTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATT ATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTGTCCTTGGA GAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGC AAGAAGTGAAGCCCGTAGAGTGGGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTC CAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAACCTTCACC TAGCCCCAGGATTACAAGGAAAAGCACCAGGCAAACCACCATCACATCTCATTTTGCAAA GGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCAT CAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGT TGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGAGCAAAATCAGGAACGCGCACTGAAAA GGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAAC ACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGA CGAGGACGAAGATGGAGACGAGAAAGATGAGAAGAAGCACAGAAGTCAACCCAAAGATCT AGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGA TGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAAC GGAGAAAAAAATGGCTCGCGCCAAAACAGTCATGAACTCCAAGACCCACCCTCCCAAGTG CATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCACCAGA CGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGA GTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAGCACAAACTGACCTGCTTCAGTGTGTA CTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCGAACT CTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCGTCTCTTGAAGGTGGTGT TAATGGCAAAAATCTTGGCCCCATAAATGAATGGTGGATCACTGGCTTTGATGGAGGTGA AAAGGCCCTCATCGGCTTCAGCACCTCATTTGCCGAATACATTCTGATGGATCCCAGTCC CGAGTATGCGCCCATATTTGGGCTGATGCAGGAGAAGATCTACATCAGCAAGATTGTGGT GGAGTTCCTGCAGAGCAATTCCGACTCGACCTATGAGGACCTGATCAACAAGATCGAGAC CACGGTTCCTCCTTCTGGCCTCAACTTGAACCGCTTCACAGAGGACTCCCTCCTGCGACA CGCGCAGTTTGTGGTGGAGCAGGTGGAGAGTTATGACGAGGCCGGGGACAGTGATGAGCA GCCCATCTTCCTGACGCCCTGCATGCGGGACCTGATCAAGCTGGCTGGGGTCACGCTGGG ACAGAGGCGAGCCCAGGCGAGGCGGCAGACCATCAGGCATTCTACCAGGGAGAAGGACAG GGGACCCACGAAAGCCACCACCACCAAGCTGGTCTACCAGATCTTCGATACTTTCTTCGC AGAGCAAATTGAAAAGGATGACAGAGAAGACAAGGAGAACGCCTTTAAGCGCCGGCGATG TGGCGTCTGTGAGGTGTGTCAGCAGCCTGAGTGTGGGAAATGTAAAGCCTGCAAGGACAT GGTTAAATTTGGTGGCAGTGGACGGAGCAAGCAGGCTTGCCAAGAGCGGAGGTGTCCCAA TATGGCCATGAAGGAGGCAGATGACGATGAGGAAGTCGATGATAACATCCCAGAGATGCC GTCACCCAAAAAAATGCACCAGGGGAAGAAGAAGAAACAGAACAAGAATCGCATCTCTTG GGTCGGAGAAGCCGTCAAGACTGATGGGAAGAAGAGTTACTATAAGAAGGTGTGCATTGA TGCGGAAACCCTGGAAGTGGGGGACTGTGTCTCTGTTATTCCAGATGATTCCTCAAAACC GCTGTATCTAGCAAGGGTCACGGCGCTGTGGGAGGACAGCAGCAACGGGCAGATGTTTCA CGCCCACTGGTTCTGCGCTGGGACAGACACAGTCCTCGGGGCCACGTCGGACCCTCTGGA GCTGTTCTTGGTGGATGAATGTGAGGACATGCAGCTTTCATATATCCACAGCAAAGTGAA AGTCATCTACAAAGCCCCCTCCGAAAACTGGGCCATGGAGGGAGGCATGGATCCCGAGTC CCTGCTGGAGGGGGACGACGGGAAGACCTACTTCTACCAGCTGTGGTATGATCAAGACTA CGCGAGATTCGAGTCCCCTCCAAAAACCCAGCCAACAGAGGACAACAAGTTCAAATTCTG TGTGAGCTGTGCCCGTCTGGCTGAGATGAGGCAAAAAGAAATCCCCAGGGTCCTGGAGCA GCTCGAGGACCTGGATAGCCGGGTCCTCTACTACTCAGCCACCAAGAACGGCATCCTGTA CCGAGTTGGTGATGGTGTGTACCTGCCCCCTGAGGCCTTCACGTTCAACATCAAGCTGTC CAGTCCCGTGAAACGCCCACGGAAGGAGCCCGTGGATGAGGACCTGTACCCAGAGCACTA CCGGAAATACTCCGACTACATCAAAGGCAGCAACCTGGATGCCCCTGAGCCCTACCGAAT TGGCCGGATCAAAGAGATCTTCTGTCCCAAGAAGAGCAACGGCAGGCCCAATGAGACTGA CATCAAAATCCGGGTCAACAAGTTCTACAGGCCTGAGAACACCCACAAGTCCACTCCAGC GAGCTACCACGCAGACATCAACCTGCTCTACTGGAGCGACGAGGAGGCCGTGGTGGACTT CAAGGCTGTGCAGGGCCGCTGCACCGTGGAGTATGGGGAGGACCTGCCCGAGTGCGTCCA GGTGTACTCCATGGGCGGCCCCAACCGCTTCTACTTCCTCGAGGCCTATAATGCAAAGAG CAAAAGCTTTGAAGATCCTCCCAACCATGCCCGTAGCCCTGGAAACAAAGGGAAGGGCAA GGGAAAAGGGAAGGGCAAGCCCAAGTCCCAAGCCTGTGAGCCGAGCGAGCCAGAGATAGA GATCAAGCTGCCCAAGCTGCGGACCCTGGATGTGTTTTCTGGCTGCGGGGGGTTGTCGGA GGGATTCCACCAAGCAGGCATCTCTGACACGCTGTGGGCCATCGAGATGTGGGACCCTGC GGCCCAGGCGTTCCGGCTGAACAACCCCGGCTCCACAGTGTTCACAGAGGACTGCAACAT CCTGCTGAAGCTGGTCATGGCTGGGGAGACCACCAACTCCCGCGGCCAGCGGCTGCCCCA GAAGGGAGACGTGGAGATGCTGTGCGGCGGGCCGCCCTGCCAGGGCTTCAGCGGCATGAA CCGCTTCAATTCGCGCACCTACTCCAAGTTCAAAAACTCTCTGGTGGTTTCCTTCCTCAG CTACTGCGACTACTACCGGCCCCGGTTCTTCCTCCTGGAGAATGTCAGGAACTTTGTCTC CTTCAAGCGCTCCATGGTCCTGAAGCTCACCCTCCGCTGCCTGGTCCGCATGGGCTATCA GTGCACCTTCGGCGTGCTGCAGGCCGGTCAGTACGGCGTGGCCCAGACTAGGAGGCGGGC CATCATCCTGGCCGCGGCCCCTGGAGAGAAGCTCCCTCTGTTCCCGGAGCCACTGCACGT GTTTGCTCCCCGGGCCTGCCAGCTGAGCGTGGTGGTGGATGACAAGAAGTTTGTGAGCAA CATAACCAGGTTGAGCTCGGGTCCTTTCCGGACCATCACGGTGCGAGACACGATGTCCGA CCTGCCGGAGGTGCGGAATGGAGCCTCGGCACTGGAGATCTCCTACAACGGGGAGCCTCA GTCCTGGTTCCAGAGGCAGCTCCGGGGCGCACAGTACCAGCCCATCCTCAGGGACCACAT CTGTAAGGACATGAGTGCATTGGTGGCTGCCCGCATGCGGCACATCCCCTTGGCCCCAGG GTCAGACTGGCGCGATCTGCCCAACATCGAGGTGCGGCTCTCAGACGGCACCATGGCCAG GAAGCTGCGGTATACCCACCATGACAGGAAGAACGGCCGCAGCAGCTCTGGGGCCCTCCG TGGGGTCTGCTCCTGCGTGGAAGGTGGGTCCTCCGGCAAAGCCTGCGACCCCGCAGCCAG GCAGTTCAACACCCTCATCCCCTGGTGCCTGCCCCACACCGGGAACCGGCACAACCACTG GGCTGGCCTCTATGGAAGGCTCGAGTGGGACGGCTTCTTCAGCACAACCGTCACCAACCC CGAGCCCATGGGCAAGCAGGGCCGCGTGCTCCACCCAGAGCAGCACCGTGTGGTGAGCGT GCGGGAGTGTGCCCGCTCCCAGGGCTTCCCTGACACCTACCGGCTCTTCGGCAACATCCT GGACAAGCACCGGCAGGTGGGCAATGCCGTGCCACCGCCCCTGGCCAAAGCCATTGGCTT GGAGATCAAGCTTTGTATGTTGGCCAAAGCCCGAGAGAGTGCCTCAGCTAAAATAAAGGA GGAGGAAGCTGCTAAGGACTAG |
||||||||||||||||||||||||||||||
Protein Properties | |||||||||||||||||||||||||||||||
Number of Residues | 1606 | ||||||||||||||||||||||||||||||
Molecular Weight | 182018.2 | ||||||||||||||||||||||||||||||
Theoretical pI | 7.69 | ||||||||||||||||||||||||||||||
Pfam Domain Function | |||||||||||||||||||||||||||||||
Signals |
|
||||||||||||||||||||||||||||||
Transmembrane Regions |
|
||||||||||||||||||||||||||||||
Protein Sequence |
>Cytosine-specific methyltransferase AVPAISLPDDVRRRLKDLERDSLTEKECVKEKLNLLHEFLQTEIKNQLCDLETKLRKEEL SEEGYLAKVKSLLNKDLSLENGAHAYNREVNGRLENGNQARSEARRVGMADANSPPKPLS KPRTPRRSKSDGEAKPEPSPSPRITRKSTRQTTITSHFAKGPAKRKPQEESERAKSDESI KEEDKDQDEKRRRVTSRERVARPLPAEEPERAKSGTRTEKEEERDEKEEKRLRSQTKEPT PKQKLKEEPDREARAGVQADEDEDGDEKDEKKHRSQPKDLAAKRRPEEKEPEKVNPQISD EKDEDEKEEKRRKTTPKEPTEKKMARAKTVMNSKTHPPKCIQCGQYLDDPDLKYGQHPPD AVDEPQMLTNEKLSIFDANESGFESYEALPQHKLTCFSVYCKHGHLCPIDTGLIEKNIEL FFSGSAKPIYDDDPSLEGGVNGKNLGPINEWWITGFDGGEKALIGFSTSFAEYILMDPSP EYAPIFGLMQEKIYISKIVVEFLQSNSDSTYEDLINKIETTVPPSGLNLNRFTEDSLLRH AQFVVEQVESYDEAGDSDEQPIFLTPCMRDLIKLAGVTLGQRRAQARRQTIRHSTREKDR GPTKATTTKLVYQIFDTFFAEQIEKDDREDKENAFKRRRCGVCEVCQQPECGKCKACKDM VKFGGSGRSKQACQERRCPNMAMKEADDDEEVDDNIPEMPSPKKMHQGKKKKQNKNRISW VGEAVKTDGKKSYYKKVCIDAETLEVGDCVSVIPDDSSKPLYLARVTALWEDSSNGQMFH AHWFCAGTDTVLGATSDPLELFLVDECEDMQLSYIHSKVKVIYKAPSENWAMEGGMDPES LLEGDDGKTYFYQLWYDQDYARFESPPKTQPTEDNKFKFCVSCARLAEMRQKEIPRVLEQ LEDLDSRVLYYSATKNGILYRVGDGVYLPPEAFTFNIKLSSPVKRPRKEPVDEDLYPEHY RKYSDYIKGSNLDAPEPYRIGRIKEIFCPKKSNGRPNETDIKIRVNKFYRPENTHKSTPA SYHADINLLYWSDEEAVVDFKAVQGRCTVEYGEDLPECVQVYSMGGPNRFYFLEAYNAKS KSFEDPPNHARSPGNKGKGKGKGKGKPKSQACEPSEPEIEIKLPKLRTLDVFSGCGGLSE GFHQAGISDTLWAIEMWDPAAQAFRLNNPGSTVFTEDCNILLKLVMAGETTNSRGQRLPQ KGDVEMLCGGPPCQGFSGMNRFNSRTYSKFKNSLVVSFLSYCDYYRPRFFLLENVRNFVS FKRSMVLKLTLRCLVRMGYQCTFGVLQAGQYGVAQTRRRAIILAAAPGEKLPLFPEPLHV FAPRACQLSVVVDDKKFVSNITRLSSGPFRTITVRDTMSDLPEVRNGASALEISYNGEPQ SWFQRQLRGAQYQPILRDHICKDMSALVAARMRHIPLAPGSDWRDLPNIEVRLSDGTMAR KLRYTHHDRKNGRSSSGALRGVCSCVEGGSSGKACDPAARQFNTLIPWCLPHTGNRHNHW AGLYGRLEWDGFFSTTVTNPEPMGKQGRVLHPEQHRVVSVRECARSQGFPDTYRLFGNIL DKHRQVGNAVPPPLAKAIGLEIKLCMLAKARESASAKIKEEEAAKD |
||||||||||||||||||||||||||||||
External Links | |||||||||||||||||||||||||||||||
GenBank ID Protein | 62088406 | ||||||||||||||||||||||||||||||
UniProtKB/Swiss-Prot ID | Q59FP7 | ||||||||||||||||||||||||||||||
UniProtKB/Swiss-Prot Entry Name | Q59FP7_HUMAN | ||||||||||||||||||||||||||||||
PDB IDs | Not Available | ||||||||||||||||||||||||||||||
GenBank Gene ID | AB209413 | ||||||||||||||||||||||||||||||
GeneCard ID | Not Available | ||||||||||||||||||||||||||||||
GenAtlas ID | Not Available | ||||||||||||||||||||||||||||||
HGNC ID | HGNC:2976 | ||||||||||||||||||||||||||||||
References | |||||||||||||||||||||||||||||||
General References | Not Available |