ID HMCU_DROME STANDARD; PRT; 2175 AA. AC P10180; DT 01-MAR-1989 (REL. 10, CREATED) DT 01-MAR-1989 (REL. 10, LAST SEQUENCE UPDATE) DT 01-MAY-1992 (REL. 22, LAST ANNOTATION UPDATE) DE HOMEOBOX PROTEIN CUT. GN CT. OS DROSOPHILA MELANOGASTER (FRUIT FLY). OC EUKARYOTA; METAZOA; ARTHROPODA; INSECTA; DIPTERA. RN [1] RP SEQUENCE FROM N.A. RM 88232956 RA BLOCHLINGER K., BODMER R., JACK J., JAN L.Y., JAN Y.N.; RL NATURE 333:629-635(1988). CC -!- FUNCTION: CUT IS INVOLVED IN SPECIFYING SENSORY ORGAN IDENTITY IN CC FRUIT FLY. IN ABSENCE OF CUT GENE EXTERNAL SENSORY ORGANS ARE CC TRANSFORMED INTO CHORDOTONAL ORGANS. DR EMBL; X07985; DMCUT. DR PIR; S03170; S03170. DR TFD; P00024; RELEASE 3.0. DR TFD; P00025; RELEASE 3.0. DR TFD; P00026; RELEASE 3.0. DR FLYBASE; 04198; RELEASE 9206. DR PROSITE; PS00027; HOMEOBOX. KW HOMEOBOX; DNA-BINDING; DEVELOPMENTAL PROTEIN; NUCLEAR PROTEIN; KW REPEAT. FT DOMAIN 194 210 ALA/GLN-RICH. FT DOMAIN 235 243 ALA-RICH. FT DOMAIN 271 293 ASP/GLU-RICH (ACIDIC). FT DOMAIN 384 428 ASN-RICH. FT DOMAIN 547 554 ASP/GLU-RICH (ACIDIC). FT DOMAIN 574 584 ASP/GLU-RICH (ACIDIC). FT DOMAIN 616 630 ALA-RICH. FT DOMAIN 665 699 HIS/GLN-RICH. FT REPEAT 886 945 'CUT'-REPEAT. FT REPEAT 1339 1398 'CUT'-REPEAT. FT REPEAT 1617 1676 'CUT'-REPEAT. FT DNA_BIND 1745 1804 HOMEOBOX. FT DOMAIN 2004 2014 ALA-RICH. FT DOMAIN 2071 2077 ASP/GLU-RICH (ACIDIC). FT DOMAIN 2124 2136 ALA/PRO-RICH. SQ SEQUENCE 2175 AA; 233628 MW; 1.697006E+07 CN; 1 MQPTLPQAAG TADMDLTAVQ SINDWFFKKE QIYLLAQFWQ QRATLAEKEV NTLKEQLSTG 61 NPDSNLNSEN SDTAAAAATA AAVAAVVAGA TATNDIEDEQ QQQLQQTASG GILESDSDKL 121 LNSSIVAAAI TLQQQNGSNL LANTNTPSPS PPLLSAEQQQ QLQSSLQQSG GVGGACLNPK 181 LFFNHAQQMM MMEAAAAAAA AALQQQQQQQ SPLHSPANEV AIPTEQPAAT VATGAAAAAA 241 AAATPIATGN VKSGSTTSNA NHTNSNNSHQ DEEELDDEEE DEEEDEDEDD EEENASMQSN 301 ADDMELDAQQ ETRTEPSATT QQQHQQQDTE DLEENKDAGE ASLNVSNNHN TTDSNNSCSR 361 KNNNGGNESE QHVASSAEDD DCANNNTNTS NNNNTSNTAT SNTNNNNNNN SSSGNSEKRK 421 KKNNNNNNGQ PAVLLAAKDK EIKALLDELQ RLRAQEQTHL VQIQRLEEHL EVKRQHIIRL 481 EARLDKQQIN EALAEATALS AAASTNNNNN SQSSDNNKKL NTAAERPMDA SSNADLPEST 541 KAPVPAEDDE EDEDQAMLVD SEEAEDKPED SHHDDDEDED EDREAVNATT TDSNELKIKK 601 EQHSPLDLNV LSPNSAIAAA AAAAAAAACA NDPNKFQALL IERTKALAAE ALKNGASDAL 661 SEDAHHQQQQ HHQQQHQHQQ QHHQQQHLHQ QHHHHLQQQP NSGSNSNPAS NDHHHGHHLH 721 GHGLLHPSSA HHLHHQTTES NSNSSTPTAA GNNNGSNNSS SNTNANSTAQ LAASLASTLN 781 GTKSLMQEDS NGLAAVAMAA HAQHAAALGP GFLPGLPAFQ FAAAQVAAGG DGRGHYRFAD 841 SELQLPPGAS MAGRLGESLI PKGDPMEAKL QEMLRYNMDK YANQALDTLH ISRRVRELLS 901 VHNIGQRLFA KYILGLSQGT VSELLSKPKP WDKLTEKGRD SYRKMHAWAC DDNAVMLLKS 961 LIPKKDSGLP QYAGRGAGGA GGDDSMSEDR IAHILSEASS LMKQSSVAQH REQERRSHGG 1021 EDSHSNEDSK SPPQSCTSPF FKVENQLKQH QHLNPEQAAA QQREREREQR EREQQQRLRH 1081 DDQDKMARLY QELIARTPRE TAFPSFLFSP SLFGGAAGMP GAASNAFPAM ADENMRHVFE 1141 REIAKLQQHQ QQQQAAQAQA QFPNFSSLMA LQQQVLNGAQ DLSLAAAAAK DIKLNGQRSS 1201 LEHSAGSSSC SKDGERDDAY PSSLHGRKSE GGGTPAPPAP PSGPGTGAGA PPTAAPPTGG 1261 ASSNSAAPSP LSNSILPPAL SSQGEEFAAT ASPLQRMASI TNSLITQPPV TPHHSTPQRP 1321 TKAVLPPITQ QQFDMFNNLN TEDIVRRVKE ALSQYSISQR LFGESVLGLS QGSVSDLLAR 1381 PKPWHMLTQK GREPFIRMKM FLEDENAVHK LVASQYKIAP EKLMRTGSYS GSPQMPQGLA 1441 SKMQAASLPM QKMMSELKLQ EPAQAQHLMQ QMQAAAMSAA MQQQQVAQAQ QQAQQAQQAQ 1501 QHLQQQAQQH LQQQQHLAQQ QHPHQQHHQA AAAAAALHHQ SMLLTSPGLP PQHAISLPPS 1561 AGGAQPGGPG GNQGSSNPSN SEKKPMLMPV HGTNAMRSLH QHMSPTVYEM AALTQDLDTH 1621 DITTKIKEAL LANNIGQKIF GEAVLGLSQG SVSELLSKPK PWHMLSIKGR EPFIRMQLWL 1681 SDANNVERLQ LLKNERREAS KRRRSTGPNQ QDNSSDTSSN DTNDFYTSSP GPGSVGSGVG 1741 GAPPSKKQRV LFSEEQKEAL RLAFALDPYP NVGTIEFLAN ELGLATRTIT NWFHNHRMRL 1801 KQQVPHGPAG QDNPIPSRES TSATPFDPVQ FRILLQQRLL ELHKERMGMS GAPIPYPPYF 1861 AAAAILGRSL AGIPGAAAAA GAAAAAAAVG ASGGDELQAL NQAFKEQMSG LDLSMPTLKR 1921 ERSDDYQDDL ELEGGGHNLS DNESLEGQEP EDKTTDYEKV LHKSALAAAA AYMSNAVRSS 1981 RRKPAAPQWV NPAGAVTNPS AVVAAVAAAA AAAADNERII NGVCVMQASE YGRDDTDSNK 2041 PTDGGNDSDH EHAQLEIDQR FMEPEVHIKQ EEDDDEEQSG SVNLDNEDNA TSEQKLKVIN 2101 EEKLRMVRVR RLSSTGGGSS EEMPAPLAPP PPPPAASSSI VSGESTTSSS SSSNTSSSTP 2161 AVTTAAATAA AGWNY //