Protein sequence for HAGRID 98 (HTT) for HAGRID 98 (HTT)


Gene information

HGNC symbolHTT
Common namehuntingtin

GO protein functions

GO:0002039p53 binding
GO:0005515protein binding
GO:0005522profilin binding
GO:0008134transcription factor binding
GO:0019900kinase binding
GO:0034452dynactin binding
GO:0042802identical protein binding
GO:0044325ion channel binding
GO:0045505dynein intermediate chain binding
GO:0048487beta-tubulin binding

Protein sequence for HAGRID 98 (HTT)

Protein sequence for HAGRID 98 (HTT) source (RefSeq)NP_002102
Sequence length3144 aa

Sequence

     1  MATLEKLMKA FESLKSFQQQ QQQQQQQQQQ QQQQQQQQQQ PPPPPPPPPP PQLPQPPPQA
    61  QPLLPQPQPP PPPPPPPPGP AVAEEPLHRP KKELSATKKD RVNHCLTICE NIVAQSVRNS
   121  PEFQKLLGIA MELFLLCSDD AESDVRMVAD ECLNKVIKAL MDSNLPRLQL ELYKEIKKNG
   181  APRSLRAALW RFAELAHLVR PQKCRPYLVN LLPCLTRTSK RPEESVQETL AAAVPKIMAS
   241  FGNFANDNEI KVLLKAFIAN LKSSSPTIRR TAAGSAVSIC QHSRRTQYFY SWLLNVLLGL
   301  LVPVEDEHST LLILGVLLTL RYLVPLLQQQ VKDTSLKGSF GVTRKEMEVS PSAEQLVQVY
   361  ELTLHHTQHQ DHNVVTGALE LLQQLFRTPP PELLQTLTAV GGIGQLTAAK EESGGRSRSG
   421  SIVELIAGGG SSCSPVLSRK QKGKVLLGEE EALEDDSESR SDVSSSALTA SVKDEISGEL
   481  AASSGVSTPG SAGHDIITEQ PRSQHTLQAD SVDLASCDLT SSATDGDEED ILSHSSSQVS
   541  AVPSDPAMDL NDGTQASSPI SDSSQTTTEG PDSAVTPSDS SEIVLDGTDN QYLGLQIGQP
   601  QDEDEEATGI LPDEASEAFR NSSMALQQAH LLKNMSHCRQ PSDSSVDKFV LRDEATEPGD
   661  QENKPCRIKG DIGQSTDDDS APLVHCVRLL SASFLLTGGK NVLVPDRDVR VSVKALALSC
   721  VGAAVALHPE SFFSKLYKVP LDTTEYPEEQ YVSDILNYID HGDPQVRGAT AILCGTLICS
   781  ILSRSRFHVG DWMGTIRTLT GNTFSLADCI PLLRKTLKDE SSVTCKLACT AVRNCVMSLC
   841  SSSYSELGLQ LIIDVLTLRN SSYWLVRTEL LETLAEIDFR LVSFLEAKAE NLHRGAHHYT
   901  GLLKLQERVL NNVVIHLLGD EDPRVRHVAA ASLIRLVPKL FYKCDQGQAD PVVAVARDQS
   961  SVYLKLLMHE TQPPSHFSVS TITRIYRGYN LLPSITDVTM ENNLSRVIAA VSHELITSTT
  1021  RALTFGCCEA LCLLSTAFPV CIWSLGWHCG VPPLSASDES RKSCTVGMAT MILTLLSSAW
  1081  FPLDLSAHQD ALILAGNLLA ASAPKSLRSS WASEEEANPA ATKQEEVWPA LGDRALVPMV
  1141  EQLFSHLLKV INICAHVLDD VAPGPAIKAA LPSLTNPPSL SPIRRKGKEK EPGEQASVPL
  1201  SPKKGSEASA ASRQSDTSGP VTTSKSSSLG SFYHLPSYLK LHDVLKATHA NYKVTLDLQN
  1261  STEKFGGFLR SALDVLSQIL ELATLQDIGK CVEEILGYLK SCFSREPMMA TVCVQQLLKT
  1321  LFGTNLASQF DGLSSNPSKS QGRAQRLGSS SVRPGLYHYC FMAPYTHFTQ ALADASLRNM
  1381  VQAEQENDTS GWFDVLQKVS TQLKTNLTSV TKNRADKNAI HNHIRLFEPL VIKALKQYTT
  1441  TTCVQLQKQV LDLLAQLVQL RVNYCLLDSD QVFIGFVLKQ FEYIEVGQFR ESEAIIPNIF
  1501  FFLVLLSYER YHSKQIIGIP KIIQLCDGIM ASGRKAVTHA IPALQPIVHD LFVLRGTNKA
  1561  DAGKELETQK EVVVSMLLRL IQYHQVLEMF ILVLQQCHKE NEDKWKRLSR QIADIILPML
  1621  AKQQMHIDSH EALGVLNTLF EILAPSSLRP VDMLLRSMFV TPNTMASVST VQLWISGILA
  1681  ILRVLISQST EDIVLSRIQE LSFSPYLISC TVINRLRDGD STSTLEEHSE GKQIKNLPEE
  1741  TFSRFLLQLV GILLEDIVTK QLKVEMSEQQ HTFYCQELGT LLMCLIHIFK SGMFRRITAA
  1801  ATRLFRSDGC GGSFYTLDSL NLRARSMITT HPALVLLWCQ ILLLVNHTDY RWWAEVQQTP
  1861  KRHSLSSTKL LSPQMSGEEE DSDLAAKLGM CNREIVRRGA LILFCDYVCQ NLHDSEHLTW
  1921  LIVNHIQDLI SLSHEPPVQD FISAVHRNSA ASGLFIQAIQ SRCENLSTPT MLKKTLQCLE
  1981  GIHLSQSGAV LTLYVDRLLC TPFRVLARMV DILACRRVEM LLAANLQSSM AQLPMEELNR
  2041  IQEYLQSSGL AQRHQRLYSL LDRFRLSTMQ DSLSPSPPVS SHPLDGDGHV SLETVSPDKD
  2101  WYVHLVKSQC WTRSDSALLE GAELVNRIPA EDMNAFMMNS EFNLSLLAPC LSLGMSEISG
  2161  GQKSALFEAA REVTLARVSG TVQQLPAVHH VFQPELPAEP AAYWSKLNDL FGDAALYQSL
  2221  PTLARALAQY LVVVSKLPSH LHLPPEKEKD IVKFVVATLE ALSWHLIHEQ IPLSLDLQAG
  2281  LDCCCLALQL PGLWSVVSST EFVTHACSLI YCVHFILEAV AVQPGEQLLS PERRTNTPKA
  2341  ISEEEEEVDP NTQNPKYITA ACEMVAEMVE SLQSVLALGH KRNSGVPAFL TPLLRNIIIS
  2401  LARLPLVNSY TRVPPLVWKL GWSPKPGGDF GTAFPEIPVE FLQEKEVFKE FIYRINTLGW
  2461  TSRTQFEETW ATLLGVLVTQ PLVMEQEESP PEEDTERTQI NVLAVQAITS LVLSAMTVPV
  2521  AGNPAVSCLE QQPRNKPLKA LDTRFGRKLS IIRGIVEQEI QAMVSKRENI ATHHLYQAWD
  2581  PVPSLSPATT GALISHEKLL LQINPERELG SMSYKLGQVS IHSVWLGNSI TPLREEEWDE
  2641  EEEEEADAPA PSSPPTSPVN SRKHRAGVDI HSCSQFLLEL YSRWILPSSS ARRTPAILIS
  2701  EVVRSLLVVS DLFTERNQFE LMYVTLTELR RVHPSEDEIL AQYLVPATCK AAAVLGMDKA
  2761  VAEPVSRLLE STLRSSHLPS RVGALHGVLY VLECDLLDDT AKQLIPVISD YLLSNLKGIA
  2821  HCVNIHSQQH VLVMCATAFY LIENYPLDVG PEFSASIIQM CGVMLSGSEE STPSIIYHCA
  2881  LRGLERLLLS EQLSRLDAES LVKLSVDRVN VHSPHRAMAA LGLMLTCMYT GKEKVSPGRT
  2941  SDPNPAAPDS ESVIVAMERV SVLFDRIRKG FPCEARVVAR ILPQFLDDFF PPQDIMNKVI
  3001  GEFLSNQQPY PQFMATVVYK VFQTLHSTGQ SSMVRDWVML SLSNFTQRAP VAMATWSLSC
  3061  FFVSASTSPW VAAILPHVIS RMGKLEQVDV NLFCLVATDF YRHQIEEELD RRAFQSVLEV
  3121  VAAPGSPYHR LLTCLRNVHK VTTC   
Download sequence as FASTA