專利名稱:桔霉素生物合成基因簇的制作方法
技術(shù)領(lǐng)域:
本發(fā)明涉及桔霉素的生物合成,屬于微生物基因技術(shù)領(lǐng)域,特別涉及桔霉素的生物合成基因。
背景技術(shù):
紅曲菌(Monascus,又稱紅曲霉)在食品和中醫(yī)藥上的應(yīng)用已有千年的歷史,現(xiàn)代研究發(fā)現(xiàn)紅曲菌能產(chǎn)生多種生理活性物質(zhì)。這些生理活性物質(zhì)主要為紅曲菌的代謝產(chǎn)物,包括抗菌活性物質(zhì)(例如monascorubrin)、降血壓物質(zhì)(例如γ-氨基丁酸,乙酰膽堿)、降血糖物質(zhì)、防癌物質(zhì)、抗氧化物質(zhì)、麥角固醇、長鏈脂肪酸、膽固醇合成抑制劑(例如monacolin)等。因此,近年來紅曲菌的研究和開發(fā)應(yīng)用受到高度的重視。
然而,除了上述有益的生理活性物質(zhì)以外,紅曲菌還會產(chǎn)生一種真菌毒素——桔霉素(citrinin)。桔霉素具有腎毒性,也稱為腎毒素(nephrotoxin),可引起實驗動物的腎臟腫大、尿量增多、腎小管擴張和上皮細胞變性壞死等癥狀,其毒性屬中等偏劇毒性。桔霉素最早是因為有抑菌功能而被作為抗生素使用,后因發(fā)現(xiàn)其毒性作用而被禁止使用。1981年Wang等從紅曲菌分離到了一種抑菌物質(zhì),將其命名為MonascidinA(Wang Hinchuang,et al.,1981)。1995年,Blanc等通過對其結(jié)構(gòu)測定和定性分析,證明Monascidin A實質(zhì)上就是桔霉素,并認為桔霉素的污染構(gòu)成了對消費者健康的危(Blanc et al.,1995)。隨后的研究顯示,產(chǎn)桔霉素問題是紅曲菌屬中各菌種普遍存在的問題。
桔霉素屬于聚酮類化合物。聚酮化合物(又稱多酮類化合物)是一類龐大的次級代謝物家族,廣泛地存在于細菌、真菌和植物中,包括許多抗生素、真菌毒素以及部分天然色素等(Varga J,et al,2003)。聚酮合酶(polyketide synthesis,PKS)是催化聚酮化合物生物合成的關(guān)鍵酶,它是以模塊結(jié)構(gòu)形式組成的,其催化過程與脂肪酸的合成類似,以一個啟動單元和若干延展單元為原料經(jīng)反復(fù)縮合和延伸而成。延伸單元插入后所形成的羰基可進行羰基還原、脫水、烯基還原等反應(yīng),然后環(huán)化、釋放而形成聚酮化合物的初始聚酮鏈骨架(即含有多個酮基的中間代謝產(chǎn)物)。初始聚酮鏈骨架經(jīng)水合、脫水、氧化、還原、脫羧、甲基化、糖基化、添加異戊烯基團、環(huán)狀結(jié)構(gòu)的形成等步驟,最后得到成熟的聚酮化合物(Shen B,et al.,2003)。這些步驟對于大多數(shù)終產(chǎn)物的生物活性來說是至關(guān)重要的。
聚酮生物合成PKS的模塊結(jié)構(gòu)組成具有一定的可塑性,可以通過改變模塊的數(shù)目、延伸模塊的特異性或結(jié)構(gòu)域的插入或失活等基因工程操作獲得新的聚酮衍生物,其產(chǎn)生的新穎的聚酮類化合物可用于篩選生理活性物質(zhì)。
目前,已有少量關(guān)于紅曲菌桔霉素生物合成基因的研究報道和公開專利。
日本專利JP2004321176提供了桔霉素生物合成的聚酮合酶基因和紫色紅曲菌桔霉素合成缺陷菌株的制備方法。中國專利CN200510039138.3提供了一種無桔霉素紅曲霉基因工程菌的構(gòu)建方法,該方法是通過缺失紅曲菌CICC5006的桔霉素編碼聚酮合酶基因而實現(xiàn)。日本大阪大學(xué)的Shimizu和Kinoshita等綜合利用LC系列引物和根據(jù)KS(ketosynthase)和AT(acyl transferase)單元的保守區(qū)而設(shè)計的新的KA系列引物,從紫色紅曲菌中克隆得到了催化合成桔霉素的PKS基因,于2004年9月24日在GenBank注冊并公布了紫色紅曲菌中催化合成桔青霉素聚酮合酶pksCT的基因序列,并進行基因敲除驗證其功能(Shimizu T,et al.,2005)。
但是,目前已經(jīng)公開報道的關(guān)于參與桔霉素生物合成的基因并不完整。已知桔霉素的生物合成是聚酮類化合物合成途徑,聚酮化合物的初始聚酮鏈骨架還需要一系列的修飾加工才能得到桔霉素。這些步驟對于桔霉素的結(jié)構(gòu)和性質(zhì)也是至關(guān)重要的。
發(fā)明內(nèi)容
本發(fā)明的目的是提供一種與紅曲菌生物合成桔霉素及其調(diào)控的相關(guān)基因,以及在工業(yè)上的相關(guān)應(yīng)用。
本發(fā)明所提供的基因分別位于序列1和序列2所提供的核苷酸序列中。
本發(fā)明所提供的桔霉素生物合成相關(guān)基因共10個基因的核苷酸序列(序列1和序列2)或互補序列,或于嚴苛條件下可與其雜交的核苷酸序列。其中有1個基因ctnR1參與桔霉素生物合成的調(diào)節(jié);有6個基因,即ctnD,ctnE,ctnF,ctnG,ctnH,ctnI編碼參與桔霉素生物合成修飾的蛋白,負責(zé)催化聚酮鏈的脫水、氧化、還原、環(huán)化等結(jié)構(gòu)的形成。此外,桔霉素生物合成基因簇中還存在3個基因orf2,orf3和orf4,它們分別編碼參與桔霉素生物合成修飾的蛋白,但是這三個蛋白質(zhì)的具體催化功能未知。這些核苷酸序列分別選自于序列1中的ctnD,ctnE,orf2和序列2中的ctnF,ctnR1,ctnG,ctnH,ctnI,orf3,orf4?;蚪Y(jié)構(gòu)圖參見圖1。
本發(fā)明還提供了一個編碼氧化還原酶蛋白的核苷酸序列,由序列3中的核苷酸序列組成,命名為ctnD,該蛋白的氨基酸序列由序列4中的氨基酸序列組成,ctnD在序列1中的位置為396bp~2312bp。
本發(fā)明還提供了一個編碼脫氫蛋白的核苷酸序列,由序列5中的核苷酸序列組成,命名為ctnE,該蛋白的氨基酸序列由序列6中的氨基酸序列組成,ctnE在序列1中的位置為2522bp~3464bp。
本發(fā)明還提供了一個編碼變位酶蛋白的核苷酸序列,由序列9中的核苷酸序列組成,命名為ctnF,該蛋白的氨基酸序列由序列10中的氨基酸序列組成,ctnF在序列2中的位置為1bp~1451bp。
本發(fā)明還提供了一個編碼碳酸酐酶蛋白的核苷酸序列,由序列17中的核苷酸序列組成,命名為ctnG,該蛋白的氨基酸序列由序列18中的氨基酸序列組成,ctnG在序列2中的位置為8147bp~12633bp。
本發(fā)明還提供了一個編碼短鏈脫氫酶蛋白的核苷酸序列,由序列19中的核苷酸序列組成,命名為ctnH,該蛋白的氨基酸序列由序列20中的氨基酸序列組成,ctnH在序列2中的位置為13379bp~14698bp。
本發(fā)明還提供了一個編碼?;o酶A合成酶蛋白的核苷酸序列,由序列21中的核苷酸序列組成,命名為ctnI,該蛋白的氨基酸序列由序列22中的氨基酸序列組成,ctnI在序列2中的位置為14993bp~18255bp。
本發(fā)明還提供了一個編碼WD重復(fù)序列蛋白的核苷酸序列,由序列13中的核苷酸序列組成,命名為ctnR1,該蛋白的氨基酸序列由序列14中的氨基酸序列組成,ctnR1在序列2中的位置為4710bp~6952bp。
本發(fā)明還提供了一個編碼參與桔霉素生物合成的蛋白的核苷酸序列,由序列7中的核苷酸序列組成,命名為orf2,該蛋白的氨基酸序列由序列8中的氨基酸序列組成,orf2在序列1中的位置為3985bp~4493bp。
本發(fā)明還提供了一個編碼參與桔霉素生物合成的蛋白的核苷酸序列,由序列11中的核苷酸序列組成,命名為orf3,該蛋白的氨基酸序列由序列12中的氨基酸序列組成,orf3在序列2中的位置為1987bp~4274bp。
本發(fā)明還提供了一個編碼參與桔霉素生物合成的蛋白的核苷酸序列,由序列15中的核苷酸序列組成,命名為orf4,該蛋白的氨基酸序列由序列16中的氨基酸序列組成,orf4在序列2中的位置為7088bp~7951bp。
本發(fā)明還提供了在基因工程微生物體中阻斷桔霉素生物合成的方法,可通過缺失或置換序列1和(或)序列2中的部分或全部序列構(gòu)建不產(chǎn)桔霉素的紅曲菌的安全工業(yè)菌株,也可以通過反義技術(shù)和RNA干擾技術(shù),將本發(fā)明中所述的基因(包括ctnD、ctnE、ctnF、ctnG、ctnH、ctnI、ctnR1、orf2、orf3和orf4)失活,而獲得不產(chǎn)桔霉素的紅曲菌的安全工業(yè)菌株。具體的實驗方法可參見相關(guān)的實驗手冊(Hannon,G.J.,RNAiA Guide to Gene Silencing.New YorkCold Spring Harbor Laboratory Press.2003;Martin J.Tymms and Ismail Kola.Gene Knockout Protocols.TotowaHumanaPress.2001)。上述紅曲菌包括紅曲菌屬(Monascus)下所有的菌種。
本發(fā)明還提供了得到至少包含部分序列1和(或)序列2中DNA序列的重組DNA載體的途徑。
本發(fā)明還提供了獲得桔霉素生物合成基因被打斷或加倍的微生物體的途徑,被打斷或加倍的基因中至少部分含有序列1和(或)序列2中的核苷酸序列。
序列1和序列2的互補序列可依據(jù)DNA堿基互補原則隨時得到。序列1和序列2的核苷酸序列或部分核苷酸序列可以通過聚合酶鏈式反應(yīng)(PCR)或用合適的限制性內(nèi)切酶酶切相應(yīng)的DNA或使用其它合適的技術(shù)得到。通過本發(fā)明所提供的核苷酸序列或部分核苷酸序列,可利用聚合酶鏈式反應(yīng)(PCR)的方法或包含本發(fā)明序列的DNA作為探針進行雜交,從其它生物體得到與桔霉素生物合成基因相似的基因。通過本發(fā)明所提供的核苷酸序列或部分核苷酸序列,還可以建立適當?shù)姆椒?,比如PCR、生物芯片、分子雜交,檢測微生物中是否含有本專利所提供的核苷酸序列或部分核苷酸序列。
本發(fā)明所述的于嚴苛條件下可與其雜交的核苷酸序列,其所述的嚴苛條件可參見文獻Sambrook J,F(xiàn)ritsch EF,Maniatis T.Molecular CloningA Laboratory Manual.2nd edition,New YorkCold Spring Harbor Laboratory Press,1989,以及文獻Nicholson,T.P.,B.A.Rudd,et al. (2001).″Design and utility ofoligonucleotide gene probes for fungal polyketide synthases.″Chem Biol 8(2)157-78.。
包含本發(fā)明所提供的核苷酸序列或至少部分序列的克隆基因或DNA片段可以通過打斷桔霉素生物合成的一個或幾個步驟而得到新的桔霉素衍生物。包含本發(fā)明所提供的DNA片段或基因可以用來提高桔霉素或者其衍生物的產(chǎn)量。
包含本發(fā)明所提供核苷酸序列或至少部分序列的克隆DNA可用來從紅曲菌基因組文庫中定位更多的文庫質(zhì)粒。這些文庫質(zhì)粒至少包含有本發(fā)明中的部分序列,也包含有紅曲菌基因組中以及鄰近區(qū)域未克隆的DNA。
本發(fā)明所提供的核苷酸序列可以被修飾或突變。這些途徑包括插入或置換,聚合酶鏈式反應(yīng),錯誤介導(dǎo)聚合酶鏈式反應(yīng),位點特異性突變,不同序列的重新連接,或通過紫外或化學(xué)試劑。
本發(fā)明所提供的核苷酸序列可以通過序列的不同部分或其它來源的同源序列進行直接進化(DNA shuffling)。
包含本發(fā)明所提供核苷酸序列或至少部分序列的克隆基因可以通過合適的表達系統(tǒng)進行表達以得到相應(yīng)的酶或桔霉素衍生物,或提高相應(yīng)的酶或桔霉素衍生物的產(chǎn)量。這些表達系統(tǒng)包括細菌,酵母菌,絲狀真菌,動物細胞,昆蟲細胞,植物細胞,或無細胞表達系統(tǒng)。
包含本發(fā)明所提供核苷酸序列或至少部分序列的片段或基因可以用來構(gòu)建衍生物庫或組合庫。
桔霉素生物合成修飾基因的核苷酸序列提供了缺失或改造這些修飾基因而得到桔霉素衍生物的途徑。
含有本發(fā)明的核苷酸序列或至少部分序列的基因或基因簇可以在異源宿主中表達并通過DNA芯片技術(shù)或其它合適的技術(shù)了解它們在宿主代謝鏈中的功能。
包含本發(fā)明的氨基酸序列或至少部分序列的多肽可能在去除或替代某個或某些氨基酸之后仍有生物活性甚至有新的生物學(xué)活性,或者提高了產(chǎn)量或優(yōu)化蛋白動力學(xué)特征或其它致力于得到的性質(zhì)。
通過合適的技術(shù)缺失、連接本發(fā)明中的氨基酸序列可以得到新的蛋白或酶,進而產(chǎn)生新的相關(guān)產(chǎn)物。
本發(fā)明所提供的氨基酸序列可以用來分離需要的蛋白質(zhì)并可以用于抗體制備。
本發(fā)明具有實質(zhì)性特點和顯著的進步,本發(fā)明所提供的基因、蛋白質(zhì)及其抗體可以用來查找和發(fā)展醫(yī)藥、工業(yè)、農(nóng)業(yè)的化合物或蛋白質(zhì)。
圖1為本發(fā)明所述基因結(jié)構(gòu)圖。
(a)為基因ctnD、ctnE和orf2的基因結(jié)構(gòu)圖。如圖1(a),基因ctnD和ctnE在基因組上位置相鄰,轉(zhuǎn)錄的方向相同,基因orf2在序列的3’端。圖中箭頭方向表示基因的轉(zhuǎn)錄方向。
(b)為基因ctnF、orf3、orf4、ctnG、ctnH、ctnI和ctnR1的基因結(jié)構(gòu)圖。如圖1(b),調(diào)節(jié)基因ctnR1位于修飾基因orf3與orf4之間,圖中箭頭方向表示基因的轉(zhuǎn)錄方向。
圖2為利用Rec-ET系統(tǒng)對大片段DNA序列重組示意圖。如圖2,sm表示選擇標記基因,A和B分別表示兩段與目的核苷酸序列同源的序列。將兩種核苷酸片段共同轉(zhuǎn)化后,通過選擇標記基因的篩選,可以得到所需的重組DNA。
圖3為重組質(zhì)粒pUC18-G/hph示意圖。如圖3,ctnD基因的置換型打靶載體。
圖4為重組質(zhì)粒pUC18-A/hph示意圖。如圖4,ctnI基因的置換型打靶載體。
具體實施例方式 下面將通過基因的提取、測序、分析、應(yīng)用實例,對本發(fā)明作進一步說明。
一、基因的提取、測序、分析。
以已經(jīng)公布的紫色紅曲菌(Monascus purpureus)桔霉素生物合成基因ctnA和ctnC的序列為模板設(shè)計PCR引物,用該引物擴增橙色紅曲菌(Monascus aurantiacusAS3.4384)基因組DNA,將PCR產(chǎn)物純化后作為探針與橙色紅曲菌總DNA基因文庫進行雜交,從中獲得包含有其同源序列的陽性fosmid質(zhì)粒,采用鳥槍法分別對其中的兩個陽性fosmid質(zhì)粒Q11J22和Q16A1進行核苷酸序列測定。將DNA片段用550 SonicDismembrator超聲波(Fisher Scientific公司)斷裂,以0.7%低熔點瓊脂糖凝膠回收其中的1.6-2.0kb片段,再經(jīng)過Geneclean II reagent kit(Bio 101,Inc公司)純化后克隆至pUC18的SmaI位點(預(yù)先經(jīng)去磷酸化處理),構(gòu)建成一系列測序亞克隆。測序亞克隆質(zhì)粒DNA的制備采用Prep 96 Plasmid Kit(Qiagen公司)。序列的測定采用BigDye Terminator Cycle Sequencing Kits(Applied Biosystem Division,Perkin Elmer公司)在337 DNA Sequencers(PE/ABD)上自動完成,測序通用引物為5’GTA AAA CGA CGG CCA GT 3’(forward),5’GCG GAT AAC AAT TTC ACA CAGG 3’(reverse)。序列的ORF分析是通過Eukariotyc GeneMark.hmm version 3.3在線軟件進行。序列的同源性比較是通過美國國家生物技術(shù)信息中心的在線服務(wù)器提供的PSI-BLAST軟件進行。
本發(fā)明中與紅曲菌桔霉素生物合成相關(guān)的基因共10個,具體為 (1)桔霉素的修飾基因,即ctnD,ctnE,ctnF,ctnG,ctnH,ctnI,orf2,orf3,orf4共9個基因; (2)桔霉素的調(diào)節(jié)基因,即ctnR1; 桔霉素的修飾基因 以下是編碼參與桔霉素生物合成的9個基因,即ctnD,ctnE,ctnF,ctnG,ctnH,ctnI,orf2,orf3,orf4的基因組序列或互補序列、CDS序列及其相應(yīng)的氨基酸序列。序列1中存在三個修飾基因,即ctnD、ctnE和orf2,序列2中存在六個修飾基因,即ctnF、orf3、orf4、ctnG、ctnH和ctnI。它們所在的基因組序列、CDS序列、氨基酸序列及其功能如表1所示。
表1桔霉素修飾基因的核苷酸、CDS、氨基酸及其功能 桔霉素的調(diào)節(jié)基因 以下是編碼參與桔霉素生物合成調(diào)節(jié)的一個基因,即ctnR1的核苷酸序列及其相應(yīng)的氨基酸序列。
ctnR1位于序列2中,該基因參與桔霉素生物合成的調(diào)節(jié),其CDS序列和氨基酸序列分別為序列13和序列14。
二、本發(fā)明提供以下應(yīng)用實例對本發(fā)明作進一步的說明,這些實例只是闡明了得到和應(yīng)用本發(fā)明所提供序列和要素的優(yōu)選的途徑,僅用作說明而并不限制本發(fā)明的應(yīng)用范圍。
應(yīng)用實例1 紅曲菌總RNA的提取與RT-PCR反應(yīng)。
取約0.2g干燥的紅曲菌菌絲體于研缽中,加入液氮,將菌絲體研磨成粉末狀。然后利用trizol試劑(Invitrogen公司)與氯仿進行萃取。所得的RNA溶于RNAase-free水中,利用RNA進行反轉(zhuǎn)錄反應(yīng)(Promega,ImProm-IITM ReverseTranscription System),得到的cDNA可作為模板,根據(jù)實驗?zāi)康模眠m當?shù)囊镞M行PCR反應(yīng)。例如,若要進行蛋白表達,可通過適當?shù)奶禺愋砸?根據(jù)本專利所提供的序列信息設(shè)計)擴增得到所需蛋白的編碼序列片段,然后與表達載體連接,而在相應(yīng)的宿主中表達。
應(yīng)用實例2 利用Rec-ET系統(tǒng)對大片段DNA序列進行重組。
根據(jù)潮霉素B抗性基因表達框的序列,合成一對引物,分別在其5’端與3’端設(shè)計與序列1或序列2中同源的55bp短序列,以含有潮霉素B抗性基因表達框序列的質(zhì)粒pUC 18-hph為模板,經(jīng)PCR擴增,得到兩端帶有與序列1或序列2中同源的55bp短序列的潮霉素B抗性基因表達框。將其與含有相應(yīng)同源片段的質(zhì)粒共轉(zhuǎn)化攜帶RecET重組酶基因的大腸桿菌宿主菌YZ2005,在RecET重組酶介導(dǎo)下,經(jīng)過同源重組,可以篩選得到兩個同源片段間的序列被潮霉素B抗性基因表達框替換的質(zhì)粒,如圖2所示。
應(yīng)用實例3 缺失桔霉素生物合成基因簇中的ctnD基因。
根據(jù)序列1中ctnD的序列,設(shè)計一對特異引物,引物的5’端帶有限制性內(nèi)切酶KpnI的識別序列及保護堿基。以紅曲菌AS3.4384基因組DNA為模板,擴增得到大約2.5 kb長度的PCR產(chǎn)物,經(jīng)TA克隆得到重組質(zhì)粒pUC18-G。將其以限制性內(nèi)切酶ApaI酶切、補平后,與潮霉素抗性基因表達框連接,得到重組質(zhì)粒pUC18-G-hph(如圖3)。將pUC18-G-hph以限制性內(nèi)切酶KpnI酶切后,轉(zhuǎn)化紅曲菌AS3.4384原生質(zhì)體并篩選基因阻斷株GHD-2,通過分子雜交和PCR驗證基因置換菌株的正確性,經(jīng)過高效液相色譜分析證明所得到的基因置換菌株中桔霉素生物合成被阻斷。該基因阻斷證明ctnD對于桔霉素的生物合成是必需的。
這提供了一個通過改造紅曲菌中修飾酶基因而得到相應(yīng)結(jié)構(gòu)改變的新物質(zhì)的途徑。
應(yīng)用實例4 缺失桔霉素生物合成基因簇中的ctnI基因 根據(jù)序列2中ctnI的序列,設(shè)計一對特異引物,引物的5’端帶有限制性內(nèi)切酶KpnI的識別序列及保護堿基。以紅曲菌AS3.4384基因組DNA為模板,擴增得到大約2.5kb長度的PCR產(chǎn)物,經(jīng)TA克隆得到重組質(zhì)粒pUC18-A。將其以限制性內(nèi)切酶ClaI酶切后,與潮霉素抗性基因表達框連接,得到重組質(zhì)粒pUC18-A-hph(如圖4)。將pUC18-A-hph以限制性內(nèi)切酶KpnI酶切后,轉(zhuǎn)化紅曲菌AS3.4384原生質(zhì)體并篩選基因阻斷株AHD-1,通過分子雜交和PCR驗證基因置換菌株的正確性,經(jīng)過高效液相色譜分析證明所得到的基因置換菌株中桔霉素生物合成被阻斷。該基因阻斷證明ctnI對于桔霉素的生物合成是必需的。
SEQUENCE LISTING
<110>南昌大學(xué)
<120>桔霉素生物合成基因簇
<160>22
<170>PatentIn version 3.4
<210>1
<211>4493
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>1
gttgcatcgt cggagtgaag aagaaaccaa ttgggacaaa agagcatggt tcattttgat 60
tcttcagcca ccaggatgat gctgaatcct ttttaccgga tcttgcttac taccggcgca120
acttttttgc aggtggtttc ggacggaata tctgcttcgg attatttctt gctttggacg180
ctagcatatc atccgtgcat gtttgaccaa gtgaataaat aatacagcct gaatgaactg240
cccgagcttc tacggcggac aatggcaggg ccatcgttgc gactacatca caaaggtagc300
ttcagacatc cagcttgttg atctagctag gagtcagcta gttgtctatg atatgctaac360
taggaagaag ctgaaccatg ggaaccacca acggcttata tatgagcacg gagtcgacgg420
ccaaacccac agtccttctt gatgatatcc gaagcccgtt cggcgatggc atacacagta480
gccatgatcg cggcgctgac ctgcatcggc atgacactgg catcaacaac acgaagacgc540
cgaacccctt tcactcgcag cttctcatct accacctgtc ccatcgcgca ggtccccaaa600
gcatggtgat agggcacgat gtgattgcga acgaactggc gagcctcgtc catattcgaa660
aggtcgagtc cagcaggcgg actcacccgc ctacgaactt tcccgttgag caacgtcgag720
cggaagactc gatctgcaaa gactatgcca gcagcaagaa catcgacatc caccggatgg780
gaaagaaatc ccggatcgat cgccggcgcg tccattggat tcgaggtccg cacgtgcaca840
ctcccgcgcg atagaggata catgttgctg acgacgatgg agtagcatgc gctgtagccg900
acggggggac ccgacatgat cttggcgcag cttgcgtaac ctgctgcggt attgaaatac960
gcaggggtgc cgatgaactg gatgtctgcg gaccagcggt tttgcatccg tgcggcgaca 1020
gcctcctgct gccggcgttg gtagctcgcg ttctgctgag acagtctgcc gctcacacta 1080
ggagcgtcga agatacgtgc catggtggcg tcgacctgag tctcggtgga tagtgacgag 1140
tatggagtaa agcccatcag actgaccgat ccagacaggg ccccggaatg gtttttggcg 1200
tagaggctct gatgctcttc tagcagagcg ggatccttga acaaagagtc caccgacatg1260
attccgtctg cacattcata agacacagct gacatggtgt gttcttgtaa attgctgccc1320
acgtcggtgt tggccactct acaagcaatc ccagcgcctt cgaggacgct cggatctcca1380
attccagaaa gctccaggag ctggggactc tggacagatc cagcgctgag aatcacttct1440
cttctagctg acacggcgta gcgggctccc gcgtggtgca gttcaactcc ttccgcgacg1500
ggtattccgt ctgaagcatc ggagagtaag atgcggcaca cctgtgcgtt ttctaggacc1560
ttcagatttg ggcgtcccat aacgggggcg tagtagccgc tgaccgcata gctccggact1620
ggcgtgctgg ttctgtctaa tgtgaacaag gacctataga atcccagatg ggcaccgcta1680
tagggctccg ccggccgctg cagacgagcg gcctcatcga acgcagccaa gagtgattct1740
tctatgggcg cttgccaggg acctatcgaa gtatggattg gcccgccggt gccatgaact1800
ttcgggctga ctggacaact tgcactgggc tcgatgggct ccaagttctc gcttcttttg1860
aagtacggta gcagttcaga ccatgtccat cctttgacac ccagcttgtt ggcccagtca1920
tcaatgtctt cggccgatgg tcggttgtaa gacatgaagt tgattccact cgagccgccc1980
agcattctgc ctcgtgggat atgatacgct ttcgcattgg cgccagcctg ggagagatca2040
ataccggctc atcatgacat cgtgaggtct cgcacctgtg ggatgctctc gaaattccag2100
tcatagccgg gatcgcccag catctgaccc ggtccggtag gaaggtcgac cttgggatcc2160
cccagcctaa gagacccagc ttcgattacc ccaacttgga tgcctggctc ctcggaaaga2220
cgagcggcga ggacgagacc ggcagttcca ccgccgacga taagaaagtc aaatggggtt2280
tggatgaaat cgatgacctt gactgtggcc atgatttgca agggcttgtc tgtcagatat2340
tcgattccgg agaagtacag agttcggctg atggctctat gtgacggaga cagtcctggt2400
tcttatacga aacgacgtac aaacgtccgg tctggacaag gctatacatt tattccatcg2460
gagtgaatat ttccccggca tgctaacatg gaggaacatg actacggtgg ttctgatcta2520
tctacagaac caacttgacc ttgagcttgt cgccctggag aatctcctcc ttccgagcca2580
tcaactctgc tacatcccag ttggcactca aatagcgtcc ggccagccag gactgtttct2640
cctgcgtcaa ccagacaatc gtgtcggcgc acaattccgg cgaatccacc aactttgcct2700
tggtgtcttc gggcaagttg gacgccaact cggtatccac ggcgcctggg tgaatggcaa2760
aggccaaaac gccctgggct gcgtactcca cacaagtgaa ctgcgtcaag cgcagcatcg2820
ccaatttccc ggtttgatag gccgaggcac caggccgggt caggtgggcg ccgatggagt2880
tcatattgac aatggtcttt tcgccgccct tcaaaagaag aggcagcatg gccctcgtca2940
tgaggtacgt gcccttgagg ttgacctccc acgtcgccca ccaggacttg ggatccgtct3000
ccgcaagcgg gacccatttt tcgacccggc cggcattatt caccaggatg tcgagacggc3060
cgaacgcccg ttcgaccctg gcggctgcat cggccacact ctgctcatcc gcgacatcca3120
gtgtcagctt gaggacctgc ggcggcggat ggcccgcgga cttcgcggcg tcgagcaccg3180
ccgtctcagc ggcgtcgagg gaggatcgtg ctccgagcgc aagggagggc gcgccggctt3240
gagcgaatgc aaccgctgtc gcgcggccga tgcccttcga tgcgccggtc acgaagacgg3300
ctcgtccatg ctgcttacac ttggccgctg tgatggttgg ataggtgtca ttgtgagtct3360
tggagatcca agtaaatcct atttgtgtcg aacgttaacg atctgttccc tggtacgcac 3420
tggacaagga ggcgaggtct accagcggac ggtggaaagg ccatgatgct cgttgggttg 3480
caatgtgggg tggatggtca ttggcatggc acgcatgttt ccatgtgtct tatataaagc 3540
cgcttacggt gcataggtcg tggttcctgc tggaaacaac cgaatatttc cgtctggaat 3600
atcttgacgt ggggaggaaa tatatttcgg agaataccat gatggtcgtg tcaacttcat 3660
gccatttcta ggccatggag gattcggata aaagatatcc gagtgagata ttccgccatc 3720
acactcggag aaactctggc ctggatggaa taaagaaaca cgacgcaaga ccgcccattt 3780
gaatggagcc aattgatagg ttcgtccacc atggaacgac tgggtacatc gaatgattgc 3840
gatggcacaa agcgcggggg ataaagaggt tcgttgggac acgggcatat aaaaagctgc 3900
ctcaagcaga tggacctggc tcgatcctca gtacaatcat cgttcaacca ccaccccgat 3960
ttttaacact ccattctcgg catcatgtct gctatccctc ctgctggcac ccctgtaggg 4020
ctcgagatcc ccgccaagga cgtggctcgc ggtaagtcgt cccggctgca tatgttaatc 4080
aattatccga caacgcaacc taagacgagc gccccgcagg atctgcattc tacaaggagg 4140
tcttcaactg gacttttgct ccctccactc tgggttttcc cgcgcacaag cttcaaacct 4200
tcgaagtccc cggcggcgtt ttccccatcg gaggcgccat gcgtctggcg gaagaaatcc 4260
cggccggtac gggcgccacc aagctctacc tctacgtgaa cgacatcggt gctgcgatgg 4320
aggtgggtgg cccatgccct gacacgagag gctgatcggc taatgaaggg taggccattg 4380
aaaagcacgg cggcaagaag gcgagcgatg ttatccccga aggcaacaag gggctgttcc 4440
agtatttcga ggacagcgag ggcaacaact atgcaatcta cacttacaag tga 4493
<210>2
<211>18809
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>2
atgcctccta tcatccattg cgtccgccac gcccaggtat gaagcaacat taattgtcaa60
tcatatcgaa aacgagagac ttaatatatt ctatcaatcc aggcaatcca caatctctct 120
gtcgcaaacc acgttatccc cgatcccatc ctcacagatc tgggcaacga acaatgccgc 180
aagctccgtg agaagtttcc ttaccattcc gacgtggaat tggtcgtctc ctcgcccctg 240
cgtcgcacga tcgccacaag cctccagggc ttcgagcccg tcttccagtc gcgggagggg 300
ctgaagttga tcgttcatcc ggatctccag gagacgagcg atgttccctg tgatacgggg 360
agtaatccgg aggttttgag ggaggagatt gagaagggtg ggcttccggt tgatttgggg 420
ttgctgtttg atgggtggaa cagtaaggtt ggttttcccc cgcttcccct ggccctggtg 480
gctatcattc taacctgaca tagaaaggac cgtatgcgcc taccaacaag gagatcaaga 540
atcgagcccg tgctgcccgt cggtggctga aggcacggcc ggaaaaggtg attgttgtcg 600
ttacccatgg tggattcttg cactatttca ccgaggactg ggaggatagc agtgaatacc 660
agggtatggt cgtccatcct ttacatggtc ctactacgac tttgtttact cacactcgtt 720
cgaagggacc ggctgggcca ataccgaatt ccgcacgttc gagtttgccg atgtcgagca 780
taaagatgat ctggaaggct acgggttgga cggtgacaac gctacgttga tcgagacggt 840
tgaatctcgc cgacgtcgtg ggaaggatgg cacaacaccc agccgtgagc agcagaaggt 900
tctgtacaag cttggggttc agggctggga taaccagggc ctcgcgctta gtgtagctga 960
gagagagaag accaaggtgc ctcaaggcga ggaggttggt ggacagcgga tatgaagaac1020
gttccataga ctggttctat gccttcctct ttctccttct gtgtatgact tggatttcat1080
ttggataaac tgacagcttc gcagagtgaa gaactagaat agatttgcct gtgaatccga1140
ggctgagaat ctcctgctca aaggatcatg tagatctcca tctaggtaac tatcaattca1200
gctgtaggat cgttaggagt aacttgaaac gacattgata gacgacgact aatgcagcgc1260
ggcactccag gcaataataa tatctgtagg caacgatatt atattgttac cattaagaca1320
gccgtggctc acatctgcgc tgcatgttgt cgcgttgttg attgcctcca tctcgatgac1380
agtaacaggc acaaagcaag cagtcagcaa tgcgtaatct cgatgctgga ttctccattg1440
aaaaagaata gcagaggacg tacagcgtaa atgctaggaa gaactcctga atcctgatgc1500
ctgaattgag aaaaagtcta gtcccctgca gtgctgcacc gccataccga gatccgctgc1560
aactcctgtt ttaaaacccc tccggataaa gcaattgacg aataatgttt cgcagaacac1620
tgcagaagga tggatgctgt gcacgaggga gatttggacg ggtttggatc gtcgctatgg1680
agtactccag gaccttatag taatatactc cgctttccgc aggcgtgcat agtacataca1740
tcctgcgcac tgcgttgctc ggcttactcg ctatttttag acagactccg gcggaaccag1800
agcagagaag caacgctgcc ctcgttgtgt tacttaagtc attgcctttc ctcacacaga1860
cgggtttcga tggaggttta ggtattttca tgacggctca aatgtcaaca tcgattgctc1920
gagactgatt caagactgtt accgtgcgtt cactgttcgt ctgctgctgt tgtcgcaaga1980
tgatcgatga gaatctaccc agtacgtcat cgccgtccgt cttggagaga atacgaagtc2040
ccgaattcac acgggtcaaa cgtcgagtac ttgctaacat caacaaccgc agccttctac2100
ctaaacaaag ccaacaagcg tcgggagaat aacaccatct acttcaccta ccgtgggagc2160
gaccccgaac ccgcctattc cttgcgctac cccgacctct cgtccccgca gtccaagaac2220
cgctatgccg ccgccctgtt cgatccctat gtccccagta tcgtctacgg agaggtcctg2280
ctgatcccgg aatggactcg accgaccttg tccgctgaag caatccgcca gaatggaggc2340
atcccgcccc cgccggaacc catcctccca tcccagttta ccatccagct ctacaatcct2400
gaccagcagg taacagtgca ttacaagccc aagtcgtgga attcgcctgc aacctgggct2460
ttcgagatgc cccagcgctc attccgtcag ccgtcgagct cgaagctgga tcgcacgcaa2520
agcgaccctg ccgtgtccga gtacacgccg aagttgaaat tcagctggcg tagggatggg2580
aagttgacca aggatctctc ctgtctcctg tcagggatga caacaacctc ccttgtagag2640
ccgaagacga agaggaagga gcctgatatc acgatctcgt tcttccggag tctgcgggag2700
atcacgctgt atgagcctaa tctctaccgg gtggagatgg aggacttcaa gggactggaa2760
ttggttctta tgctgggcgc agtcgtcatc cgggacgtct atttcagtcc gctcagagag2820
gcctttaatg tttctgatcc gccgaccggt gctggcaagg tgaaggacgc tgctgcaaag2880
ccagcagcga ccagccctac aggatcatct ccactggacg gaccagtggc gtctggcgca2940
ttgaatggag gcccttcccc caaaccggat cgtccgcaga gacctcatat cacaatccca3000
caggagaaac cacaacgacc gccgtcgccc gtggacatac ggtcgcagga ggaaatccaa3060
gccgagaaag tccgtatgca acagcgaagg gaatgggcgg cgcaggagga acagcgtcgc3120
acgcgaaaat tgctagaagc ggaagagaag gccagacgcc ggcgacaggt ggaagtcgac3180
aaggagacga aacgactgca gaagctctac ggcgaggaag agcggagagt cctggagcag3240
cagcgactac agcattcatc accaggaaaa ccggctactc cgccgcgcag cagtcagaac3300
aactgtctcc agcagcctca gcctcagcat cagatatacc ggcaccacaa ctcagcgtcc3360
gtagcgcatc tcaactcttc gagtccctat ctgcaagggc cgttatcgca ctcctccgtc3420
cagctcttcc aaccgccaca cttggctgcg tcgatgcaca gcaacggcaa tggattgacg3480
ccgcagatga aaaagaaaag cagtttcttt ggatttcgga agtctcccga cgaggctaaa3540
cttagcaaaa agcggagttc catgttttga tgatcattta gtttgttttc ttcattgcgg3600
ctggtgtatg gtgtttacgg tgtatgcagt gggttaacaa tacccaaccg ctctgggagc3660
atggaatgtg gtacgaagcc atgatggcct tccctttaat ggaagaacag cagatatata3720
cattgcacag agtagaccga tccagaagga tgcaggggct ggccatcagc cctgcaagct3780
gggcatgcaa cccttccaac ccggtaacct aaccttcgag gcagaagacc aaattattat3840
tcggcatgtc tgcatgcacg ctcaaatgcc taataataaa cccccgcatc catgcacagt3900
cacatggctc cagcagctgc agggtggctg agtccaacac gatgctcctt gcagcccaat3960
ccaatgcgtg ggatgcacat acagcgttcc agcgtgagcc atatcgtcac caccatcatc4020
tgtgcccgct tggcgcagtg ggtctgcact gttcaagcca ggtatatcca tctcgaaagt4080
cggcatgtgc ttccaggtcc tgcttccagg tccggctcct tgggcatccg gatgcagctt4140
tgcatgatta tctggtggag agaagagaag ggaagttgat catcgtccct gaaagaaaag4200
aactcacact agagcacctg ctgtaccgaa taacgaatgg agaaggaaag gggagacgcg4260
agttgcttca atagtacgga attaggctgc cgtctgagcg tggaaagttt ccctcaggca4320
gacacaacag ctccttcaat ccagatctca tagtacttgc gcgttatact ctgtgtggca4380
gcattcataa agtaaatata gactcgcctc gcattccccg tttcccctcg gaccagtcaa4440
accagaccag gccggatcgg gagagacaca gaagccacag ctatgcggcg tagcgctggc4500
gaaactgccg atctgacggc tgctctaggg tgaatacgaa taacttataa tctcattatt4560
actaactccc gccttatgag gtcatctcgt cccgcatgct tttcccttca actcctctcc4620
tctcccctct gcttgagcgg aatgcagact gacaacaact cgtcgctcca tcgacatccc4680
tcatcccatc gtccttcttc ctgcgagcca tgttggctca tcccatcagc catgtgacag4740
cagcaacaac atccgcatca ccatcaccat cgcaccccag cccggtccgc gagacttctt4800
ctgaacctga ctccgactcc gactccgact ccgggccatc gcgcacgtca ggagttgccg4860
tcatcgctgc cgccgcctca tcctcgacag tggagatgga tgcctcggat tccgaaatga4920
cagatgcgcc ggaggatgaa gggggcgtcg ctatcggccc ctacttggac ccgcactatc4980
acagcatgga ggccatcatg cagcagctgg gtgatgcaac ggggcacaca tccatcgatt5040
caactgcgtc ggacaacaat gacgatgacg acgacgacga cgacatggca gatcagcatc5100
gccgccatgt atactcgggt gaggaactgc cggtatctgg tgttgaatca tcgtctgctc5160
catggggcct gcatgatatc ctcgatggct atggtcacgt ccactcgaac ttgtctgatg5220
tcgatgtcga tgatttctac cgagccttta actacgacgc cttgccccag cttccgggca5280
cctctgcgac acctggtcac gacgatcatg cttactccta cgatgatgac tcgaccgctg5340
gtctgggcgc agatgaccac tttcctgctt ctgtcgtaca tggaacaagt atgtttattc5400
cctcatgcgt atcgtcgtat tgatcaatag actgacgtgg gtggtctagc gtctgaacga5460
aatctcacgg tcgaccagtt tattgaccaa tggcttatcc aatcatcctc cacacacctt5520
cggtactttc ttccacctaa accatccatt ccgcaacgct acggcttggc ccttttgaac5580
tgggcgcctc ctccgaggat acttcggccc agtcggtacc ccgatgcatc ctacgatatc5640
caacagatcc cgtggtggga agttatgcgc gtcaagagac cccaggtccg ggccttgcgc5700
gacgcctgtt atacgtcata tcataatctg gagtactccc ctggagtaag acgtgtctcg5760
taccttggtc aaatggctcc gcctggtccg ctgacgatga cttgtctagt ttgcgcaacg5820
actcccagac gacgagactt tctttcgggg caagtccatg cacacgaagc acagggcaac5880
catcgaacac tttcagcttc ggaatctcat gtcggtcgtc tcacacaaca ccgtggaatt5940
tgctcatgag tcaaaactgt attcctgggt ccccggctac gatgatctgg tctgcttgat6000
tgatctttcc aaaccatccg tggaatcctg cttccagtgc cccgtcaaga tctccacaat6060
gagctcccgc cacggcgtct cgattgctgg tggcttctgc ggcgaatatg cattgcgtgc6120
agcaggcacg gacgagccag ctgcggaggg ctacgtgacg aaggatttca atggcatcac6180
aacccacatc gacatcgtca aacaccggac cagtcggtct cctacggcga tcgttgcttc6240
caatgacaag cacctgcggg ttctggactg cgagtccaac cagtttgtgg cggattatgc6300
cctcttctgc gcggtcaact gcaccgctac gtcacccgac ggtcgccttc gtgcagtcgt6360
cggcgactcc ccggacgcat gggtcatcga agccgacacc ggacgacctg tccatccgct6420
ccgtggccac cgtgatttcg gattcgcttg tgcctggtcc ccggacatgc gtcacatcgc6480
gaccggtaat caggacaaaa ccgtgatcat ctgggatgcg cgcacctggc gcatcctcaa6540
gacgatcgaa tccgacgtgg caggctaccg atccctccga ttctcccccg tcggtggggg6600
tcctccgacg ttactgttgt gcgaacccgc ggaccggatc gccattgtcg acgcacagac6660
gtaccaaagt cgacaggtcc atgatttctt cggggagatc gggggcgccg attacacgcc6720
cgatggcggg gctatctggg tggccaacac cgatcagcat ttcggggggt tcatggagta6780
tgaacgatgg cagtggggga agagacttgg tctcagtgat ctgccgaatg agtggttacc6840
ggagtcggag ttggagaatg atgagcgatg cgttctgagc gcacgagaga gacagatgag6900
attctgcagg aatctaaccg atgaagaaca tgatgagttg ttgctagcct gacattaccc6960
tttttcctct tcttttcttc ttttcacccc ctacttatcg attacctatt gatctcttat7020
tcgtgtcatt ttgtcatctc atacattgca cattgcatgt ttcccgtact gtacatattt7080
cggatcgatg gcgtttacat cagcgacaga tatcatattg ggcgtgggtg catcatccgg7140
ggtaacaagt ggtctgtctg tgtatcagct ttcatgtatt tatttttatt attttttttt7200
tctcgattca ctcatcttct ggtgccttaa ctatatagcg tgatgcagtg aaacatacat7260
atgtatagat gggtactcat tgtacctagt tcttaaactc gatctctcta tacttgatca7320
actcgaagct tttcctttct tgaatatatg ctgtacttaa agtgttagct actttagctt7380
atatatacca tgtaaaaaga gataaaggac atctttaaat catgatgaat gtctttttag7440
atagatttta tttctagaaa agatctctga acattacact gattttaccc ttaacaattt7500
ctatatccca taatcaataa aaagaaattg cgaccactgg agatacatgc caaaaacaag7560
taataagaaa tgtacaagtc agcgacatag tacttttttg ggcccctaat aacaatatct7620
atattttaga atgcattcac atccacacct acctgaacca acgcctcaag aatccgatgg7680
ttccctcttt ccgcagcaac ctgtaacgga gccatccctc tcttatcctt cagtgtcgga7740
ctcgcccccg agttccccga agccgcagcg agatgtaagg cggatttgga cgagagggag7800
gaaccgcacg gggagtggga gtgaccttgg gattgggact ctgctggagg agaaatgtag7860
gattgcagga gactgcaatt atgtggtgtc gcgctatctg gggcatccag ggagggttgc7920
ggtctacatt tgttgtgaga gcgatggatg attctggggg tattctgcag tcatggtgac7980
aacttctcgc tcatagtaaa gaaatagctg caccatctct gcaatgtctg ctctttcttt8040
gatagtcgtc caagatctca gcagagccgt tgagctatat ttacatctgc tctatgctct8100
agaagtagct gtactacttc tacaataccc ttccaagagg aaattgtcag ggcattgcca8160
taatatgcag actatgggta tttatgtctg ctccatggtc aagaagtagt tgcaccattc8220
ttctgtttcc tggagaggag gcagcagcaa caaaggcacc cccatatttg tcaaaaaacg8280
gactttctga aatccgagca ttgacatctg caccattgtc aagaagaagt tgtgctgctt8340
ctatatttcc tgccttagca gcagcggcca gtgcgcttcc acaattctgg ccattgacgt8400
cagctccatg attaagaagc atctgtatta cttccgtaat gcgagcacct gtggattccc8460
tgttcgggtt agttttgccg ctttcggtat ggtcaccccg tacgccgtcc cgtaaacatc8520
gccctggacc tggtggtctg tttacctagg ggcagaataa catgggtccg tggtcatgtg8580
atgatgagta gggcggtgaa tagtggcaga atgaagtttg tagactgcaa cttctaattg8640
ctcgctgttg atgctggctg aggctatccg aagccctata tcgggcctag gccgtcacag8700
agcccatcag agaggggagg tagtgagtgt agaggtagga gggctgcaca cttacagcag8760
gactgtagct tagccaacac ctaggcaaat gccatcgcac ataatatcaa gaccttgaac8820
gaaagaactt caaacaccaa aaattcttgt tgagccatct aactacacgg gtctaacagt8880
ctatttcacg tcagcagagc tgatgctgaa gtctgaactt tcccggggtt cccccagtac8940
agtagtagta ctacacaggt gctgggcagt taaacagcag cttgcttctg gaagttacag9000
gtggaaaggt ggccgtttct ggcctcagtt gtggtcattt ttagcatcgt tttcgcaggt9060
cctcataatt tgatagtttg ttatgtggcc tgcatgcact tacattatca ttaatgttaa9120
ctaattatcc ttgtgcatca gtcagttatt ccctgctgca tgcatatcca gtcagctgat9180
tctacattgt acattattat taatgcataa tactagaaca aaaaaaagta aataaaacta9240
taccgaatat ggtatagtgt taatcccaac cggccgaata tgtgcatcgg atccgagatt9300
ctaccgtctt cggccggaat ctaaccgcat caggtagtat tgtcgtcgtt ttgcgtggga9360
tcaggttgtt taaccgattg ccaagtggag gttgcgttca ttgattattc aattagatag9420
atgactagtt tttcttcttt tttgttttcc tgtttttttt ctctttcttc ctatctctat9480
tttttcccct ttgctttctg gttgtgtttg ctggtttttc tggtttcaac tcttgcattg9540
ctgtcgggtg tctcttcttt gccctgttgg ggatacactg gtacagagtc tgacttcgtc9600
agcccctttt atggaatact cttgagaaaa attatttaaa tcatggaatc ggattcaatc9660
gatactatga tgaaatttca aactcgcgca ctgacttggc cgttatattg accaccctcg9720
agaaacacct tgctcaaaca tgaataccga tgaaaatatc agaaatgaaa acaaaaggat9780
aacaaacagg atatctatgc cactaatttg aacaattcca gtcgcgcctt gttgacctcg9840
tccggctctc ccgtctccaa ctcccgcagc atcccactag ccacatcata tatcagacca9900
tgcacgcaca gaccgcgctc ttgaatagca tccaacacgg cgctcttctg cttcaatacc9960
ttgaccccag ccaatacgtt cagttcggtc agcttcagtg cccgctcctc agttgaaaga 10020
gactggagag tgtccagatg ttgttcacga atctgtcgca gtgggagcag ccatgggtcc 10080
agaatcccaa gctgcttatt tcctagtgca gccgccacgc ctccgcagct agtgtgtccg 10140
cacagaacga tgtgcttgac tttaagttgt tttacggcat attcaatgac ggcagacgag 10200
ctgagatcgg ttgggtgaag gatgttggcg atattcctgt ggacgaagac atcgccaggc 10260
ttcaggccaa ggagcgtcgt ctcagggcat cgtgagtcgg agcatccgat ccacacaatt 10320
tcaggacttt ggccggtcgc aagggttggg aagagatctg gatgctcttg ggctgttttt 10380
gtcgcccact ccttgttctg agtcagaaca gtcgccaacc gatctgggtg gtaagaattc 10440
gaagaagcga atggagacat gatggtcgcg cctgcagatg atgcagcggt tgatgaaagg 10500
gggcggcaag aagttgtgaa attccctgaa aggttgtggt gatggtgaca attattgtgg 10560
ctcgcctgaa gccgttgtgg agagtgggtt gcgatagccg aggaaatgcg atgatgacaa 10620
tcgcggaatt ctcttgtgga atgaaaagag cgactgaact tctgaatcac taccctcgca 10680
aagttcatgc caaattacac aaatggtttg cgatgcgtct agtatgccgg tgtaacggag 10740
gaatggttga gtcagaaaga aaaaagaacg aaagggtttt ctgcggtctg tgtgggacag 10800
ccaaggaagc agaagaagga gggtgcaatg taagcaaggc tcgcacttga gaccagcaca 10860
acaggattgt cttctgttgg ttgtttcaag atctgatgtg gatggataga tgatggtttg 10920
gggtgaacag tggaagtggt gctaacatgc agagtcaaca aaatcaattt ttggcgggag 10980
gccctttgcc aagccaaccg gaagcggaac cagctccgtt gaacatcaga aaatgacaca 11040
tcgaaatggt ttcaatgact ggtatattct atgtctgcaa gaaaagaaaa agcagacatc 11100
tggctgttat tgatctgatt tctctgaggg tccaggctga aaagaaagag actagccaag 11160
cagtgcagta cagcgtaaat atccaagggg agcttaattc cgtgtaattc ttttcctgat 11220
aattctagca tattaaacta ttctatccct gcgagaaaaa acgaaaaata tgtacttcag 11280
gaaagaggaa tttaattctc cacaatattc atgcaagatg tagatatgta attacagcgt 11340
agatgccaga ttcgcggtgt gcgcatatgg cccgttacgc cctccgttgt tcagaggtag 11400
agaacttcac gggctaaagg atcatttgtt gatttctctg cataccaagg aacactcctc11460
agtcaatcct tgactatact ttccgctatt gtgcggagaa aaggatatgc gtggatgtct11520
aattcaagct gctacatgtt catgtcgtct aggtgcgcca gcatgtgata aaagccttga11580
agtgatgcat gcaaggcaat ttccaaatct atcaagcatg tactccgtag tagtcgtagt11640
aagagtacaa atggcagaag catatggaat cgattcatcc ctcccaacaa gcagcagacc11700
gtgacacctg tggctccata ccgattaacc cgagtcaaag tagatatgcg tacatggctc11760
atgagaaaac aaagacgtat cataatgcag acagtagaga gtggagagaa cagacaactg11820
caggacgaac caaagaaaca aaattcatta gcaagaaaag tacttgacca gaacagaaca11880
agcttcccag caacttgtgt atccattatt acggagtaaa atcattcaat tcagaacctg11940
tccccccaga gcccgacaat cgccatggca agggcagtcc ccgtaagcca cgatcttgtt12000
ctctgaagta gacgttcctt ctccatttcc ttctgtacgc tctcaccgtt aaccttatcc12060
tccgcctcga cgacaacgat gtcatcttct ttcttcggag aggtggtctt cgccggaaga12120
cagccgctag gaagcagctg ggtatcccgg accaagctga gaaaccactc tttcaacccg12180
atgtcccggt ggaaccagaa gtcgacgccg tacgagccca aggtcgagac catggacacc12240
cagagcaggt agggatgttt gcggtgggct ggggagacga agaaagcgat caggacgcag12300
ccgttggcga tgttggcgag ccggagggcg tgtttgcggc ttaaccgctt gatctcgtcg12360
aggcagcggg cggcggtgga ggaggttggg aggtgttgca gggagggaat ggtgatgttg12420
gcggaggagt aggaaaggcc ctatagtcga aaagttggtt agccatgaac aaaagaggga12480
gagagctatt cgtctcatct cgcatgtatg gccccggata ggaatagaaa gctgaactca12540
gacccagcca tgacttcgtc aaataatgca taccgtcaag aggccgagag acactgtccc12600
gacgaacttt gtaagtgtaa ttgggcacac cattgtgggt gatggacttt ttctggacct12660
tttctggact ttttctgaag gagcaaagac tgtagcggga cgtcggatac agtccacgcg12720
aaggcagtcc agtccggatc agtgattgca gatacggaat cccggcgaga ttgaacgtgt12780
tgatgtgcga aaggtgaagg acggacagac ggcgcgaatc ggaagatggc agggatggaa12840
gtaatcggtc gactggggaa gtcaacggaa gtgacgtcga attgaaactg actcggctac12900
cgaccttagt tatcccttca ttgattaagt catggattat tactgctact ctcggttcga12960
ggaacatgga tgaggattcc gctattcttg cctgcattct tgcttagtat tcaggaatat13020
gccggtcgcg acaggtttct gtttctttta tccacacgga gtcgagtgaa ctccagaata13080
ctccgtacca gcaggggatt cattctctgc aagagcacaa tataaaagga tataggccgt13140
gtcattcaag tcctgattga ggctgagaac acttttttcc atgactacag gcgttgtctc13200
gtgaagatat catttcccga ggctgtttct gccgcggccg aaatccgtca tggcgtcgat13260
tgctttatgg gacgaccgcc ctccatcaac acaaacagcc acggtagctt tataaaggcc13320
aatctcaaac aggtccattc tcttctcgac ctgctgcctc ttcggaaatt tatccaggat13380
gtccaccccc aagtcagatt acgtcagtga cgactggaag gacggcttgt tcagtgagtc13440
ttcgatcttt tattctcctc aagcaaagcg aattctaatt gccaatctcg tcgagttaga13500
aaataaggtc gtcttctgta ccggtggtgc gggcaccatc tgcagcgctc aagtccgtgc13560
cctggtccat ctaggtgcag atgcctgcat cgtggggaga aacgtcgaga agacagaacg 13620
tgctgccaag gacattgcga gtgttagagc cggcgcgaga gtcatcggta ttggcgcggt 13680
ggatgtgcgg aaatatgaca gtctgaagga cgctgctgag cgctgcatta aggagttggg 13740
tggcattgac tttgtgatgt gtgtgctgtt ttctcttctg aatcgtttcg cccatttgga 13800
cagggatggt ctagattgtt aacgtccacg ttcacgccca cagtgcaggc gcggcaggaa 13860
acttccttgc gtcgatcaac caactttccg tgaacgcctt caagtcagtg atggacatcg 13920
atgtcctggg ctcctacaac accgtcaagg ccaccattcc gtacctcgtt gaatcagcga 13980
agaaacacaa agttgattcc aaaacccgtg agcttctccc atctccttga ttgtcaatgg 14040
tatttcttaa cggaactcca tttctacttc cagtccagcc ttcccctgcc ggcacaggtg 14100
gaagaatcat cttcgtcagt gccacgctcc actacagagg atcacccttc cagacgcatg 14160
tggctgtcgc caaagccgga gtggatgcgc tgtcgaacaa tgtggctatc gaatttggcc 14220
ctctgggagt aacttccaat gtcattgctc ctggaccgat tgcacaaacg gaggttagcc 14280
tttctatccc tctttccccc tcctcccttt cgatgctatt tcaaatggtg aactgcgtcg 14340
gtagcgtgct aattcacacg acaaaagggt ctcgaacgtc tcctcccgcc ggatgtcaaa 14400
gaaatgtaca ccaaatcgca accacttggt cggctgggat ctgtcagaga catcgccgat 14460
gcgacggtat atctcttgtc gaacactgga agctatgtaa atggacaatt attagttggt 14520
atgttcaggg tggatcatat tgtttatgct ccctgcagcc gacggacgac taactggaat 14580
gattaatcct acagttgacg gtggctcctg gcgcaccagc ggcgatttct cgtacccgga 14640
cttcttgttg gcaggaggag aatttgaagg agtgaagggg aagaaatcga aactttgact 14700
gactctttgc tggccgattg atcatatacc ccgggcttga tctcacctct tgaggtacca 14760
actctaaatt tacattcctt atcttatatt aattctcata gtaaatccca tgtcagaatt 14820
aagagtcgac caattattat agagtagaac tggagaactg ggtatttcca ctctgcgaca 14880
ctggactgcg gggatagacc aaggctgtga taccttctga agagacccag gagacgcatg 14940
tgatgcaata ggagaacgca ttattattat tgattgacat gaatcgaccc atatgggagg 15000
ggaacacaga gagtccagat gtctgtgcca agtgctaagc tctgcaacta ctccgcatcc 15060
ccagacaatc ttcggcgaca gacgtgctct gaaccagact gaaccagacg ctgggttggc 15120
cctcactttg gctgtaatgc atgttgatgc atttgttgag attgcggtca atctatatta 15180
ttatgctacg atgttgtcct cttcatgtgc tgcgagcatg ttgtcgggta aacctctgtc 15240
acgcagagca ctgcagctgc tcaaccgagg catccaaggc ccctgtgatt gttgaacgct 15300
ggatcataat taatagactg cgtctcctcc agcggcctta taccgggtct cttctggtgt 15360
cgtcctgttt ggctcgattt ccccataatc gccctcttcc gcccttccga agctcttttc 15420
ttgacatcat ttcccggttc caggaacacc acccatcaaa gcttgcctcg cggcgcgcgc 15480
gtccttcctg gcacgtttcc tttgtcgtgc acatttgcct ggcttgagaa gaccgttacg 15540
aggaacattg tcgtatataa tatccggcgt tacatgatgc cgcctttgac ggctttggct 15600
ggtaacgttt gattgaaaca ccgcccatag ctccccagtg atcattcttt ctaactttgg 15660
tcctttgcgt cccaggtatc cccctcaaag tcgccattcc cgctgctgca acgactctag 15720
cctacctgaa cgccagatgg tccgtgtcgc acgatgtggc tctcggtcgg gccttcgctc 15780
atacagtctt gcagccggcg ttggccgaac gtaatgatcg cctgaacctc ttttacctcc 15840
tggagtacta cgctctcagc ccgaagctag ccaacaacac ctggattgta tacaatggcc 15900
gcagttggac gttccacgag gggtacgaga tggtattacg ctatggccac tggctcaaga 15960
cggttcacgg ggttaagccc aaggagatcg tggccatgga ctttatgaac tcgtcgacct 16020
tcatcttctt gatgttcggt ctctggagta tcggtgccgt gcccgccttt atcaactaca 16080
acttgagtgg caagcctctg acgcactcgg tgaaggcgtc caccgcaagg ttgctgttcg 16140
tcgatgagga tgttcgggag tgcttcccgc aggaacagct ggatatcttt acgtctccgg 16200
atttccatga ggacaagggt ccgatgactg tcgtcttctt taccccagat ctggaggcgc 16260
agattctgca gacgcagccg gtccgtgagg atgacagagc gcgacagggc gtcatccgcc 16320
gcgacatggc catcttgatc tacaccagcg gcacaacggg cttgcccaaa cccgctattg 16380
tcagctgggc caagtgttac gccggcggtt atttttccgg ctcttatatg ggtttgaaac 16440
agtccgatcg attctacacg gtaagctccg ttcatcatct cttcgtttcc tacgttcatt 16500
gctgacgatt cgaggacagt gtatgcctct ctaccattca tccgccacgc tcatgtgctt 16560
ctgtgcttgt ttaacggtcg gatgcacatg tatcatcggc cggaaatttt ctgcccgtaa 16620
tttctggaaa gaggtccgcg agaatgatgc gactgcggtt caatacgtcg gtgaaaccat 16680
gcgctatctt cttgccgtcc ctcccgaggt cgatcccgtg acgggcgagg atcttgacaa 16740
gaagcacaat gtacgaatta tctttggcaa cggactacgc ccggacgtct ggaataaggt 16800
caaggagcgt ttcaatatcc caactgtatg cgaattctac gcatctacag agggtagctc 16860
ggccacgtgg aacttatcat cgaacagcca cagtgcgggt gctattggta ggaatggcgc 16920
catcgccaga tttgtcttcg aacgtcgcca tgccattgtt gctgtagacc atgagagtca 16980
gcagccctgg cgggatccca agacggggct ttgcaaggcg gtgcctcggg gagaaccggg 17040
cgagctgctg cttgctctcg atgccaagga cactgaagcc atgttccagg gttacttcaa 17100
gaacaacaag gcgacagaag acaagatcat ccgcgatgta ttaaccaagg gtgacgccta 17160
tttccgcact ggcgatatga ttcggtggga cagcaatggt ctgtggtact tctctgatcg 17220
catgggcgat accttccggt ggaggagtga gaatgtgtaa gtggctataa cctgtcggcc 17280
tgttgcatct atactaatgg caataatttc tagttccacg agtgaagttg ctgaagtact 17340
gggagcgcac cctgaagtgc acgaggccaa cgtctacggc gttgccttac ctcaccacga 17400
cgggcgtgct ggatgcgctg ccatcgtctt tagacatcag gcccagaata cagacccttc 17460
gtcaggggtc attgacccgt caccccaggt gcttggtgac gttgcatcct acgcattgaa 17520
gaacctgccc aaatacgcgg tgcccatctt cctgcgcgtg acgccagaga tgcaggcgac 17580
ggggaataac aagcaacaga agcatgtcct gcaaaaggaa ggcgtggatc cttccaaggt 17640
gaatgccaaa gacaagctat attggcttcg gggtgctacg tatgtgccat tccagcagaa 17700
ggactgggag aggttgaatg ccgggcaggt caggctttga ttcctatctg tttgaaaatt 17760
gtgtctttct tcttcccctt ccttttgtcc tagaaccaac cctgttcctg aacgcaactg 17820
cgtgtaatac cccgtattat tcgtagatct cttgttatat aataactata gaagagtgga 17880
cattgctctc tgcgcatgtt gcttttgctt aaagtatctg ccctgtgtat attcatggga 17940
tatgattgac ccatctgatt gccagcgagg taagatgcaa ctttggcagg caatgtagag 18000
ggagatatcc atgtatctat tctcattatg attcaaaaaa actgaatgca gctccattca 18060
tgtcttggca tgtcttgatg cacttcctcc ctcaacattt gcttccatgg acttcagata 18120
agttttcgat aaattccatc tggctttctc ttgttataat tacccgcatt gtcttgttct 18180
gtcgaatacc atgctcagtc agtcaatcac aatgagtcgc tgcaaagctt tgtcacgaga 18240
atcagcacaa tctgatctgt ttcttccttt gttcgagttc gcaatgcacg caataagatg 18300
aaagaagtaa agaaagagag atagggccat gctgggtaac cgttgtttat tatgtccttc 18360
caaaacaatg ggtggatcgg ggtatggata tataaatatg ggatggtatc ttctcgtcca 18420
attcttctgg gtaagtattt tgattcctcg aaaaccatac tatttaatta ttctgattta 18480
tttctcttta ttctcatctc atctagcctc gctttattca tagttgtcta gttttcattg 18540
cgcatgcgtc gatctaatcg acttccaaga actaccggta ggattatatt tccggttgtt 18600
atattaatac actctgccag atcgaaaata agtaatcata accatggctc gccacttgaa 18660
cagccttcca agcatggata gggaaggaga ttcgaatatg gactcatctc agatgtccca 18720
attgagcctg gaagatctat cggaacaggt aagaatatac ctcacgtaca gcaaacccca 18780
gtaaggaaaa gaaagggaaa ttgactgac18809
<210>3
<211>1869
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(1869)
<400>3
atg gcc aca gtc aag gtc atc gat ttc atc caa acc cca ttt gac ttt 48
Met Ala Thr Val Lys Val Ile Asp Phe Ile Gln Thr Pro Phe Asp Phe
1 5 10 15
ctt atc gtc ggc ggt gga act gcc ggt ctc gtc ctc gcc gct cgt ctt 96
Leu Ile Val Gly Gly Gly Thr Ala Gly Leu Val Leu Ala Ala Arg Leu
20 25 30
tcc gag gag cca ggc atc caa gtt ggg gta atc gaa gct ggg tct ctt144
Ser Glu Glu Pro Gly Ile Gln Val Gly Val Ile Glu Ala Gly Ser Leu
35 40 45
agg ctg ggg gat ccc aag gtc gac ctt cct acc gga ccg ggt cag atg192
Arg Leu Gly Asp Pro Lys Val Asp Leu Pro Thr Gly Pro Gly Gln Met
50 55 60
ctg ggc gat ccc ggc tat gac tgg aat ttc gag agc atc cca cag gct240
Leu Gly Asp Pro Gly Tyr Asp Trp Asn Phe Glu Ser Ile Pro Gln Ala
65 70 75 80
ggc gcc aat gcg aaa gcg tat cat atc cca cga ggc aga atg ctg ggc288
Gly Ala Asn Ala Lys Ala Tyr His Ile Pro Arg Gly Arg Met Leu Gly
85 90 95
ggc tcg agt gga atc aac ttc atg tct tac aac cga cca tcg gcc gaa336
Gly Ser Ser Gly Ile Asn Phe Met Ser Tyr Asn Arg Pro Ser Ala Glu
100 105 110
gac att gat gac tgg gcc aac aag ctg ggt gtc aaa gga tgg aca tgg384
Asp Ile Asp Asp Trp Ala Asn Lys Leu Gly Val Lys Gly Trp Thr Trp
115 120 125
tct gaa ctg cta ccg tac ttc aaa aga agc gag aac ttg gag ccc atc432
Ser Glu Leu Leu Pro Tyr Phe Lys Arg Ser Glu Asn Leu Glu Pro Ile
130 135 140
gag ccc agt gca agt tgt cca gtc agc ccg aaa gtt cat ggc acc ggc480
Glu Pro Ser Ala Ser Cys Pro Val Ser Pro Lys Val His Gly Thr Gly
145 150 155 160
ggg cca atc cat act tcg ata ggt ccc tgg caa gcg ccc ata gaa gaa528
Gly Pro Ile His Thr Ser Ile Gly Pro Trp Gln Ala Pro Ile Glu Glu
165 170 175
tca ctc ttg gct gcg ttc gat gag gcc gct cgt ctg cag cgg ccg gcg576
Ser Leu Leu Ala Ala Phe Asp Glu Ala Ala Arg Leu Gln Arg Pro Ala
180 185 190
gag ccc tat agc ggt gcc cat ctg gga ttc tat agg tcc ttg ttc aca624
Glu Pro Tyr Ser Gly Ala His Leu Gly Phe Tyr Arg Ser Leu Phe Thr
195 200 205
tta gac aga acc agc acg cca gtc cgg agc tat gcg gtc agc ggc tac 672
Leu Asp Arg Thr Ser Thr Pro Val Arg Ser Tyr Ala Val Ser Gly Tyr
210 215 220
tac gcc ccc gtt atg gga cgc cca aat ctg aag gtc cta gaa aac gca720
Tyr Ala Pro Val Met Gly Arg Pro Asn Leu Lys Val Leu Glu Asn Ala
225 230 235 240
cag gtg tgc cgc atc tta ctc tcc gat gct tca gac gga ata ccc gtc768
Gln Val Cys Arg Ile Leu Leu Ser Asp Ala Ser Asp Gly Ile Pro Val
245 250 255
gcg gaa gga gtt gaa ctg cac cac gcg gga gcc cgc tac gcc gtg tca816
Ala Glu Gly Val Glu Leu His His Ala Gly Ala Arg Tyr Ala Val Ser
260 265 270
gct aga aga gaa gtg att ctc agc gct gga tct gtc cag agt ccc cag864
Ala Arg Arg Glu Val Ile Leu Ser Ala Gly Ser Val Gln Ser Pro Gln
275 280 285
ctc ctg gag ctt tct gga att gga gat ccg agc gtc ctc gaa ggc gct912
Leu Leu Glu Leu Ser Gly Ile Gly Asp Pro Ser Val Leu Glu Gly Ala
290 295 300
ggg att gct tgt aga gtg gcc aac acc gac gtg ggc agc aat tta caa960
Gly Ile Ala Cys Arg Val Ala Asn Thr Asp Val Gly Ser Asn Leu Gln
305 310 315 320
gaa cac acc atg tca gct gtg tct tat gaa tgt gca gac gga atc atg 1008
Glu His Thr Met Ser Ala Val Ser Tyr Glu Cys Ala Asp Gly Ile Met
325 330 335
tcg gtg gac tct ttg ttc aag gat ccc gct ctg cta gaa gag cat cag 1056
Ser Val Asp Ser Leu Phe Lys Asp Pro Ala Leu Leu Glu Glu His Gln
340 345 350
agc ctc tac gcc aaa aac cat tcc ggg gcc ctg tct gga tcg gtc agt 1104
Ser Leu Tyr Ala Lys Asn His Ser Gly Ala Leu Ser Gly Ser Val Ser
355 360 365
ctg atg ggc ttt act cca tac tcg tca cta tcc acc gag act cag gtc 1152
Leu Met Gly Phe Thr Pro Tyr Ser Ser Leu Ser Thr Glu Thr Gln Val
370 375 380
gac gcc acc atg gca cgt atc ttc gac gct cct agt gtg agc ggc aga 1200
Asp Ala Thr Met Ala Arg Ile Phe Asp Ala Pro Ser Val Ser Gly Arg
385 390 395 400
ctg tct cag cag aac gcg agc tac caa cgc cgg cag cag gag gct gtc 1248
Leu Ser Gln Gln Asn Ala Ser Tyr Gln Arg Arg Gln Gln Glu Ala Val
405 410 415
gcc gca cgg atg caa aac cgc tgg tcc gca gac atc cag ttc atc ggc 1296
Ala Ala Arg Met Gln Asn Arg Trp Ser Ala Asp Ile Gln Phe Ile Gly
420 425 430
acc cct gcg tat ttc aat acc gca gca ggt tac gca agc tgc gcc aag1344
Thr Pro Ala Tyr Phe Asn Thr Ala Ala Gly Tyr Ala Ser Cys Ala Lys
435 440 445
atc atg tcg ggt ccc ccc gtc ggc tac agc gca tgc tac tcc atc gtc1392
Ile Met Ser Gly Pro Pro Val Gly Tyr Ser Ala Cys Tyr Ser Ile Val
450 455 460
gtc agc aac atg tat cct cta tcg cgc ggg agt gtg cac gtg cgg acc1440
Val Ser Asn Met Tyr Pro Leu Ser Arg Gly Ser Val His Val Arg Thr
465 470 475 480
tcg aat cca atg gac gcg ccg gcg atc gat ccg gga ttt ctt tcc cat1488
Ser Asn Pro Met Asp Ala Pro Ala Ile Asp Pro Gly Phe Leu Ser His
485 490 495
ccg gtg gat gtc gat gtt ctt gct gct ggc ata gtc ttt gca gat cga1536
Pro Val Asp Val Asp Val Leu Ala Ala Gly Ile Val Phe Ala Asp Arg
500 505 510
gtc ttc cgc tcg acg ttg ctc aac ggg aaa gtt cgt agg cgg gtg agt1584
Val Phe Arg Ser Thr Leu Leu Asn Gly Lys Val Arg Arg Arg Val Ser
515 520 525
ccg cct gct gga ctc gac ctt tcg aat atg gac gag gct cgc cag ttc1632
Pro Pro Ala Gly Leu Asp Leu Ser Asn Met Asp Glu Ala Arg Gln Phe
530 535 540
gtt cgc aat cac atc gtg ccc tat cac cat gct ttg ggg acc tgc gcg1680
Val Arg Asn His Ile Val Pro Tyr His His Ala Leu Gly Thr Cys Ala
545 550 555 560
atg gga cag gtg gta gat gag aag ctg cga gtg aaa ggg gtt cgg cgt1728
Met Gly Gln Val Val Asp Glu Lys Leu Arg Val Lys Gly Val Arg Arg
565 570 575
ctt cgt gtt gtt gat gcc agt gtc atg ccg atg cag gtc agc gcc gcg1776
Leu Arg Val Val Asp Ala Ser Val Met Pro Met Gln Val Ser Ala Ala
580 585 590
atc atg gct act gtg tat gcc atc gcc gaa cgg gct tcg gat atc atc1824
Ile Met Ala Thr Val Tyr Ala Ile Ala Glu Arg Ala Ser Asp Ile Ile
595 600 605
aag aag gac tgt ggg ttt ggc cgt cga ctc cgt gct cat ata taa1869
Lys Lys Asp Cys Gly Phe Gly Arg Arg Leu Arg Ala His Ile
610 615620
<210>4
<211>622
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>4
Met Ala Thr Val Lys Val Ile Asp Phe Ile Gln Thr Pro Phe Asp Phe
1 5 10 15
Leu Ile Val Gly Gly Gly Thr Ala Gly Leu Val Leu Ala Ala Arg Leu
20 25 30
Ser Glu Glu Pro Gly Ile Gln Val Gly Val Ile Glu Ala Gly Ser Leu
35 40 45
Arg Leu Gly Asp Pro Lys Val Asp Leu Pro Thr Gly Pro Gly Gln Met
50 55 60
Leu Gly Asp Pro Gly Tyr Asp Trp Asn Phe Glu Ser Ile Pro Gln Ala
65 70 75 80
Gly Ala Asn Ala Lys Ala Tyr His Ile Pro Arg Gly Arg Met Leu Gly
85 90 95
Gly Ser Ser Gly Ile Asn Phe Met Ser Tyr Asn Arg Pro Ser Ala Glu
100 105 110
Asp Ile Asp Asp Trp Ala Asn Lys Leu Gly Val Lys Gly Trp Thr Trp
115 120 125
Ser Glu Leu Leu Pro Tyr Phe Lys Arg Ser Glu Asn Leu Glu Pro Ile
130 135 140
Glu Pro Ser Ala Ser Cys Pro Val Ser Pro Lys Val His Gly Thr Gly
145 150 155 160
Gly Pro Ile His Thr Ser Ile Gly Pro Trp Gln Ala Pro Ile Glu Glu
165 170 175
Ser Leu Leu Ala Ala Phe Asp Glu Ala Ala Arg Leu Gln Arg Pro Ala
180 185 190
Glu Pro Tyr Ser Gly Ala His Leu Gly Phe Tyr Arg Ser Leu Phe Thr
195 200 205
Leu Asp Arg Thr Ser Thr Pro Val Arg Ser Tyr Ala Val Ser Gly Tyr
210 215 220
Tyr Ala Pro Val Met Gly Arg Pro Asn Leu Lys Val Leu Glu Asn Ala
225 230 235 240
Gln Val Cys Arg Ile Leu Leu Ser Asp Ala Ser Asp Gly Ile Pro Val
245 250 255
Ala Glu Gly Val Glu Leu His His Ala Gly Ala Arg Tyr Ala Val Ser
260 265 270
Ala Arg Arg Glu Val Ile Leu Ser Ala Gly Ser Val Gln Ser Pro Gln
275 280 285
Leu Leu Glu Leu Ser Gly Ile Gly Asp Pro Ser Val Leu Glu Gly Ala
290 295 300
Gly Ile Ala Cys Arg Val Ala Asn Thr Asp Val Gly Ser Asn Leu Gln
305 310 315 320
Glu His Thr Met Ser Ala Val Ser Tyr Glu Cys Ala Asp Gly Ile Met
325 330 335
Ser Val Asp Ser Leu Phe Lys Asp Pro Ala Leu Leu Glu Glu His Gln
340 345 350
Ser Leu Tyr Ala Lys Asn His Ser Gly Ala Leu Ser Gly Ser Val Ser
355 360 365
Leu Met Gly Phe Thr Pro Tyr Ser Ser Leu Ser Thr Glu Thr Gln Val
370 375 380
Asp Ala Thr Met Ala Arg Ile Phe Asp Ala Pro Ser Val Ser Gly Arg
385 390 395 400
Leu Ser Gln Gln Asn Ala Ser Tyr Gln Arg Arg Gln Gln Glu Ala Val
405 410 415
Ala Ala Arg Met Gln Asn Arg Trp Ser Ala Asp Ile Gln Phe Ile Gly
420 425 430
Thr Pro Ala Tyr Phe Asn Thr Ala Ala Gly Tyr Ala Ser Cys Ala Lys
435 440 445
Ile Met Ser Gly Pro Pro Val Gly Tyr Ser Ala Cys Tyr Ser Ile Val
450 455 460
Val Ser Asn Met Tyr Pro Leu Ser Arg Gly Ser Val His Val Arg Thr
465 470 475 480
Ser Asn Pro Met Asp Ala Pro Ala Ile Asp Pro Gly Phe Leu Ser His
485 490 495
Pro Val Asp Val Asp Val Leu Ala Ala Gly Ile Val Phe Ala Asp Arg
500 505 510
Val Phe Arg Ser Thr Leu Leu Asn Gly Lys Val Arg Arg Arg Val Ser
515 520 525
Pro Pro Ala Gly Leu Asp Leu Ser Asn Met Asp Glu Ala Arg Gln Phe
530 535 540
Val Arg Asn His Ile Val Pro Tyr His His Ala Leu Gly Thr Cys Ala
545550 555 560
Met Gly Gln Val Val Asp Glu Lys Leu Arg Val Lys Gly Val Arg Arg
565 570 575
Leu Arg Val Val Asp Ala Ser Val Met Pro Met Gln Val Ser Ala Ala
580 585 590
Ile Met Ala Thr Val Tyr Ala Ile Ala Glu Arg Ala Ser Asp Ile Ile
595 600 605
Lys Lys Asp Cys Gly Phe Gly Arg Arg Leu Arg Ala His Ile
610 615 620
<210>5
<211>828
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(828)
<400>5
atg gcc ttt cca ccg tcc gct gcg gcc aag tgt aag cag cat gga cga 48
Met Ala Phe Pro Pro Ser Ala Ala Ala Lys Cys Lys Gln His Gly Arg
1 5 10 15
gcc gtc ttc gtg acc ggc gca tcg aag ggc atc ggc cgc gcg aca gcg 96
Ala Val Phe Val Thr Gly Ala Ser Lys Gly Ile Gly Arg Ala Thr Ala
20 25 30
gtt gca ttc gct caa gcc ggc gcg ccc tcc ctt gcg ctc gga gca cga144
Val Ala Phe Ala Gln Ala Gly Ala Pro Ser Leu Ala Leu Gly Ala Arg
35 40 45
tcc tcc ctc gac gcc gct gag acg gcg gtg ctc gac gcc gcg aag tcc192
Ser Ser Leu Asp Ala Ala Glu Thr Ala Val Leu Asp Ala Ala Lys Ser
50 55 60
gcg ggc cat ccg ccg ccg cag gtc ctc aag ctg aca ctg gat gtc gcg240
Ala Gly His Pro Pro Pro Gln Val Leu Lys Leu Thr Leu Asp Val Ala
65 70 75 80
gat gag cag agt gtg gcc gat gca gcc gcc agg gtc gaa cgg gcg ttc288
Asp Glu Gln Ser Val Ala Asp Ala Ala Ala Arg Val Glu Arg Ala Phe
85 90 95
ggc cgt ctc gac atc ctg gtg aat aat gcc ggc cgg gtc gaa aaa tgg336
Gly Arg Leu Asp Ile Leu Val Asn Asn Ala Gly Arg Val Glu Lys Trp
100 105 110
gtc ccg ctt gcg gag acg gat ccc aag tcc tgg tgg gcg acg tgg gag384
Val Pro Leu Ala Glu Thr Asp Pro Lys Ser Trp Trp Ala Thr Trp Glu
115 120 125
gtc aac ctc aag ggc acg tac ctc atg acg agg gcc atg ctg cct ctt432
Val Asn Leu Lys Gly Thr Tyr Leu Met Thr Arg Ala Met Leu Pro Leu
130 135 140
ctt ttg aag ggc ggc gaa aag acc att gtc aat atg aac tcc atc ggc480
Leu Leu Lys Gly Gly Glu Lys Thr Ile Val Asn Met Asn Ser Ile Gly
145 150 155 160
gcc cac ctg acc cgg cct ggt gcc tcg gcc tat caa acc ggg aaa ttg528
Ala His Leu Thr Arg Pro Gly Ala Ser Ala Tyr Gln Thr Gly Lys Leu
165 170 175
gcg atg ctg cgc ttg acg cag ttc act tgt gtg gag tac gca gcc cag576
Ala Met Leu Arg Leu Thr Gln Phe Thr Cys Val Glu Tyr Ala Ala Gln
180 185 190
ggc gtt ttg gcc ttt gcc att cac cca ggc gcc gtg gat acc gag ttg624
Gly Val Leu Ala Phe Ala Ile His Pro Gly Ala Val Asp Thr Glu Leu
195 200 205
gcg tcc aac ttg ccc gaa gac acc aag gca aag ttg gtg gat tcg ccg672
Ala Ser Asn Leu Pro Glu Asp Thr Lys Ala Lys Leu Val Asp Ser Pro
210 215 220
gaa ttg tgc gcc gac acg att gtc tgg ttg acg cag gag aaa cag tcc720
Glu Leu Cys Ala Asp Thr Ile Val Trp Leu Thr Gln Glu Lys Gln Ser
225 230 235 240
tgg ctg gcc gga cgc tat ttg agt gcc aac tgg gat gta gca gag ttg768
Trp Leu Ala Gly Arg Tyr Leu Ser Ala Asn Trp Asp Val Ala Glu Leu
245 250 255
atg gct cgg aag gag gag att ctc cag ggc gac aag crc aag gtc aag816
Met Ala Arg Lys Glu Glu Ile Leu Gln Gly Asp Lys Leu Lys Val Lys
260 265 270
ttg gtt ctg tag828
Leu Val Leu
275
<210>6
<211>275
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>6
Met Ala Phe Pro Pro Ser Ala Ala Ala Lys Cys Lys Gln His Gly Arg
1 5 10 15
Ala Val Phe Val Thr Gly Ala Ser Lys Gly Ile Gly Arg Ala Thr Ala
20 25 30
Val Ala Phe Ala Gln Ala Gly Ala Pro Ser Leu Ala Leu Gly Ala Arg
35 40 45
Ser Ser Leu Asp Ala Ala Glu Thr Ala Val Leu Asp Ala Ala Lys Ser
50 55 60
Ala Gly His Pro Pro Pro Gln Val Leu Lys Leu Thr Leu Asp Val Ala
65 70 75 80
Asp Glu Gln Ser Val Ala Asp Ala Ala Ala Arg Val Glu Arg Ala Phe
85 90 95
Gly Arg Leu Asp Ile Leu Val Asn Asn Ala Gly Arg Val Glu Lys Trp
100 105 110
Val Pro Leu Ala Glu Thr Asp Pro Lys Ser Trp Trp Ala Thr Trp Glu
115 120 125
Val Asn Leu Lys Gly Thr Tyr Leu Met Thr Arg Ala Met Leu Pro Leu
130 135 140
Leu Leu Lys Gly Gly Glu Lys Thr Ile Val Asn Met Asn Ser Ile Gly
145 150 155 160
Ala His Leu Thr Arg Pro Gly Ala Ser Ala Tyr Gln Thr Gly Lys Leu
165 170 175
Ala Met Leu Arg Leu Thr Gln Phe Thr Cys Val Glu Tyr Ala Ala Gln
180 185 190
Gly Val Leu Ala Phe Ala Ile His Pro Gly Ala Val Asp Thr Glu Leu
195 200 205
Ala Ser Asn Leu Pro Glu Asp Thr Lys Ala Lys Leu Val Asp Ser Pro
210 215 220
Glu Leu Cys Ala Asp Thr Ile Val Trp Leu Thr Gln Glu Lys Gln Ser
225 230 235 240
Trp Leu Ala Gly Arg Tyr Leu Ser Ala Asn Trp Asp Val Ala Glu Leu
245 250 255
Met Ala Arg Lys Glu Glu Ile Leu Gln Gly Asp Lys Leu Lys Val Lys
260 265 270
Leu Val Leu
275
<210>7
<211>390
<212>DNA
<213>橙色紅曲菌(Monascus aurant iacus AS3.4384)
<220>
<221>CDS
<222>(1)..(390)
<400>7
atg tct gct atc cct cct gct ggc acc cct gta ggg ctc gag atc ccc48
Met Ser Ala Ile Pro Pro Ala Gly Thr Pro Val Gly Leu Glu Ile Pro
1 5 10 15
gcc aag gac gtg gct cgc gga tct gca ttc tac aag gag gtc ttc aac96
Ala Lys Asp Val Ala Arg Gly Ser Ala Phe Tyr Lys Glu Val Phe Asn
20 25 30
tgg act ttt gct ccc tcc act ctg ggt ttt ccc gcg cac aag ctt caa144
Trp Thr Phe Ala Pro Ser Thr Leu Gly Phe Pro Ala His Lys Leu Gln
35 40 45
acc ttc gaa gtc ccc ggc ggc gtt ttc ccc atc gga ggc gcc atg cgt192
Thr Phe Glu Val Pro Gly Gly Val Phe Pro Ile Gly Gly Ala Met Arg
50 55 60
ctg gcg gaa gaa atc ccg gcc ggt acg ggc gcc acc aag ctc tac ctc240
Leu Ala Glu Glu Ile Pro Ala Gly Thr Gly Ala Thr Lys Leu Tyr Leu
65 70 75 80
tac gtg aac gac atc ggt gct gcg atg gag gcc att gaa aag cac ggc288
Tyr Val Asn Asp Ile Gly Ala Ala Met Glu Ala Ile Glu Lys His Gly
85 90 95
ggc aag aag gcg agc gat gtt atc ccc gaa ggc aac aag ggg ctg ttc336
Gly Lys Lys Ala Ser Asp Val Ile Pro Glu Gly Asn Lys Gly Leu Phe
100 105 110
cag tat ttc gag gac agc gag ggc aac aac tat gca atc tac act tac384
Gln Tyr Phe Glu Asp Ser Glu Gly Asn Asn Tyr Ala Ile Tyr Thr Tyr
115 120 125
aag tga390
Lys
<210>8
<211>129
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>8
Met Ser Ala Ile Pro Pro Ala Gly Thr Pro Val Gly Leu Glu Ile Pro
1 5 10 15
Ala Lys Asp Val Ala Arg Gly Ser Ala Phe Tyr Lys Glu Val Phe Asn
20 25 30
Trp Thr Phe Ala Pro Ser Thr Leu Gly Phe Pro Ala His Lys Leu Gln
35 40 45
Thr Phe Glu Val Pro Gly Gly Val Phe Pro Ile Gly Gly Ala Met Arg
50 55 60
Leu Ala Glu Glu Ile Pro Ala Gly Thr Gly Ala Thr Lys Leu Tyr Leu
65 70 75 80
Tyr Val Asn Asp lle Gly Ala Ala Met Glu Ala Ile Glu Lys His Gly
85 90 95
Gly Lys Lys Ala Ser Asp Val Ile Pro Glu Gly Asn Lys Gly Leu Phe
100 105 110
Gln Tyr Phe Glu Asp Ser Glu Gly Asn Asn Tyr Ala Ile Tyr Thr Tyr
115 120 125
Lys
<210>9
<211>873
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(873)
<400>9
atg cct cct atc atc cat tgc gtc cgc cac gcc cag gca atc cae aat48
Met Pro Pro Ile Ile His Cys Val Arg His Ala Gln Ala Ile His Asn
15 10 15
ctc tct gtc gca aac cac gtt atc ccc gat ccc atc ctc aca gat ctg96
Leu Ser Val Ala Asn His Val Ile Pro Asp Pro Ile Leu Thr Asp Leu
20 25 30
ggc aac gaa caa tgc cgc aag ctc cgt gag aag ttt cct tac cat tcc 144
Gly Asn Glu Gln Cys Arg Lys Leu Arg Glu Lys Phe Pro Tyr His Ser
35 40 45
gac gtg gaa ttg gtc gtc tcc tcg ccc ctg cgt cgc acg atc gcc aca 192
Asp Val Glu Leu Val Val Ser Ser Pro Leu Arg Arg Thr Ile Ala Thr
50 55 60
agc ctc cag ggc ttc gag ccc gtc ttc cag tcg cgg gag ggg ctg aag240
Ser Leu Gln Gly Phe Glu Pro Val Phe Gln Ser Arg Glu Gly Leu Lys
65 70 75 80
ttg atc gtt cat ccg gat ctc cag gag acg agc gat gtt ccc tgt gat288
Leu Ile Val His Pro Asp Leu Gln Glu Thr Ser Asp Val Pro Cys Asp
85 90 95
acg ggg agt aat ccg gag gtt ttg agg gag gag att gag aag ggt ggg336
Thr Gly Ser Asn Pro Glu Val Leu Arg Glu Glu Ile Glu Lys Gly Gly
100 105 110
ctt ccg gtt gat ttg ggg ttg ctg ttt gat ggg tgg aac agt aag aaa384
Leu Pro Val Asp Leu Gly Leu Leu Phe Asp Gly Trp Asn Ser Lys Lys
115 120 125
gga ccg tat gcg cct acc aac aag gag atc aag aat cga gcc cgt gct432
Gly Pro Tyr Ala Pro Thr Asn Lys Glu Ile Lys Asn Arg Ala Arg Ala
130 135 140
gcc cgt cgg tgg ctg aag gca cgg ccg gaa aag gtg att gtt gtc gtt480
Ala Arg Arg Trp Leu Lys Ala Arg Pro Glu Lys Val Ile Val Val Val
145 150 155 160
acc cat ggt gga ttc ttg cac tat ttc acc gag gac tgg gag gat agc528
Thr His Gly Gly Phe Leu His Tyr Phe Thr Glu Asp Trp Glu Asp Ser
165 170 175
agt gaa tac cag ggg acc ggc tgg gcc aat acc gaa ttc cgc acg ttc576
Ser Glu Tyr Gln Gly Thr Gly Trp Ala Asn Thr Glu Phe Arg Thr Phe
180 185 190
gag ttt gcc gat gtc gag cat aaa gat gat ctg gaa ggc tac ggg ttg624
Glu Phe Ala Asp Val Glu His Lys Asp Asp Leu Glu Gly Tyr Gly Leu
195 200 205
gac ggt gac aac gct acg ttg atc gag acg gtt gaa tct cgc cga cgt672
Asp Gly Asp Asn Ala Thr Leu Ile Glu Thr Val Glu Ser Arg Arg Arg
210 215 220
cgt ggg aag gat ggc aca aca ccc agc cgt gag cag cag aag gtt ctg720
Arg Gly Lys Asp Gly Thr Thr Pro Ser Arg Glu Gln Gln Lys Val Leu
225 230 235 240
tac aag ctt ggg gtt cag ggc tgg gat aac cag ggc ctc gcg ctt agt768
Tyr Lys Leu Gly Val Gln Gly Trp Asp Asn Gln Gly Leu Ala Leu Ser
245 250 255
gta gct gag aga gag aag acc aag gtg cct caa ggc gag gag gca caa 816
Val Ala Glu Arg Glu Lys Thr Lys Val Pro Gln Gly Glu Glu Ala Gln
260 265 270
agc aag cag tea gca atg cgt aat ctc gat gct gga ttc tcc att gaa 864
Ser Lys Gln Ser Ala Met Arg Asn Leu Asp Ala Gly Phe Ser Ile Glu
275 280 285
aaa gaa tag 873
Lys Glu
290
<210>10
<211>290
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>10
Met Pro Pro Ile Ile His Cys Val Arg His Ala Gln Ala Ile His Asn
1 5 10 15
Leu Ser Val Ala Asn His Val Ile Pro Asp Pro Ile Leu Thr Asp Leu
20 25 30
Gly Asn Glu Gln Cys Arg Lys Leu Arg Glu Lys Phe Pro Tyr His Ser
35 40 45
Asp Val Glu Leu Val Val Ser Ser Pro Leu Arg Arg Thr Ile Ala Thr
50 55 60
Ser Leu Gln Gly Phe Glu Pro Val Phe Gln Ser Arg Glu Gly Leu Lys
65 70 75 80
Leu Ile Val His Pro Asp Leu Gln Glu Thr Ser Asp Val Pro Cys Asp
85 90 95
Thr Gly Ser Asn Pro Glu Val Leu Arg Glu Glu Ile Glu Lys Gly Gly
100 105 110
Leu Pro Val Asp Leu Gly Leu Leu Phe Asp Gly Trp Asn Ser Lys Lys
115 120 125
Gly Pro Tyr Ala Pro Thr Asn Lys Glu Ile Lys Asn Arg Ala Arg Ala
130 135 140
Ala Arg Arg Trp Leu Lys Ala Arg Pro Glu Lys Val Ile Val Val Val
145 150 155 160
Thr His Gly Gly Phe Leu His Tyr Phe Thr Glu Asp Trp Glu Asp Ser
165 170 175
Ser Glu Tyr Gln Gly Thr Gly Trp Ala Asn Thr Glu Phe Arg Thr Phe
180 185 190
Glu Phe Ala Asp Val Glu His Lys Asp Asp Leu Glu Gly Tyr Gly Leu
195 200 205
Asp Gly Asp Asn Ala Thr Leu Ile Glu Thr Val Glu Ser Arg Arg Arg
210 215 220
Arg Gly Lys Asp Gly Thr Thr Pro Ser Arg Glu Gln Gln Lys Val Leu
225 230 235 240
Tyr Lys Leu Gly Val Gln Gly Trp Asp Asn Gln Gly Leu Ala Leu Ser
245 250 255
Val Ala Glu Arg Glu Lys Thr Lys Val Pro Gln Gly Glu Glu Ala Gln
260 265 270
Ser Lys Gln Ser Ala Met Arg Asn Leu Asp Ala Gly Phe Ser Ile Glu
275 280 285
Lys Glu
290
<210>11
<211>2061
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(2061)
<400>11
atg aga atc tac cca gta cgt cat cgc cgt ccg tct tgg aga gaa tac 48
Met Arg Ile Tyr Pro Val Arg His Arg Arg Pro Ser Trp Arg Glu Tyr
1 5 10 15
gaa gtc ccg aat tca cac ggg tca aac gtc gag tac ttg cta aca tca 96
Glu Val Pro Asn Ser His Gly Ser Asn Val Glu Tyr Leu Leu Thr Ser
20 25 30
aca acc gca gcc ttc tac cta aac aaa gcc aac aag cgt cgg gag aat144
Thr Thr Ala Ala Phe Tyr Leu Asn Lys Ala Asn Lys Arg Arg Glu Asn
35 40 45
aac acc atc tac ttc acc tac cgt ggg agc gac ccc gaa ccc gcc tat192
Asn Thr Ile Tyr Phe Thr Tyr Arg Gly Ser Asp Pro Glu Pro Ala Tyr
50 55 60
tcc ttg cgc tac ccc gac ctc tcg tcc ccg cag tcc aag aac cgc tat240
Ser Leu Arg Tyr Pro Asp Leu Ser Ser Pro Gln Ser Lys Asn Arg Tyr
65 70 75 80
gcc gcc gcc ctg ttc gat ccc tat gtc ccc agt atc gtc tac gga gag288
Ala Ala Ala Leu Phe Asp Pro Tyr Val Pro Ser Ile Val Tyr Gly Glu
85 90 95
gtc ctg ctg atc ccg gaa tgg act cga ccg acc ttg tcc gct gaa gca336
Val Leu Leu Ile Pro Glu Trp Thr Arg Pro Thr Leu Ser Ala Glu Ala
100 105 110
atc cgc cag aat gga ggc atc ccg ccc ccg ccg gaa ccc atc ctc cca384
Ile Arg Gln Asn Gly Gly Ile Pro Pro Pro Pro Glu Pro Ile Leu Pro
115 120 125
tcc cag ttt acc atc cag ctc tac aat cct gac cag cag gta aca gtg432
Ser Gln Phe Thr Ile Gln Leu Tyr Asn Pro Asp Gln Gln Val Thr Val
130 135 140
cat tac aag ccc aag tcg tgg aat tcg cct gca acc tgg gct ttc gag480
His Tyr Lys Pro Lys Ser Trp Asn Ser Pro Ala Thr Trp Ala Phe Glu
145 150 155 160
atg ccc cag cgc tca ttc cgt cag ccg tcg agc tcg aag ctg gat cgc528
Met Pro Gln Arg Ser Phe Arg Gln Pro Ser Ser Ser Lys Leu Asp Arg
165 170 175
acg caa agc gac cct gcc gtg tcc gag tac acg ccg aag ttg aaa ttc576
Thr Gln Ser Asp Pro Ala Val Ser Glu Tyr Thr Pro Lys Leu Lys Phe
180 185 190
agc tgg cgt agg gat ggg aag ttg acc aag gat ctc tcc tgt ctc ctg624
Ser Trp Arg Arg Asp Gly Lys Leu Thr Lys Asp Leu Ser Cys Leu Leu
195 200 205
tca ggg atg aca aca acc tcc ctt gta gag ccg aag acg aag agg aag 672
Ser Gly Met Thr Thr Thr Ser Leu Val Glu Pro Lys Thr Lys Arg Lys
210 215 220
gag cct gat atc acg atc tcg ttc ttc cgg agt ctg cgg gag atc acg 720
Glu Pro Asp Ile Thr Ile Ser Phe Phe Arg Ser Leu Arg Glu Ile Thr
225 230 235 240
ctg tat gag cct aat ctc tac cgg gtg gag atg gag gac ttc aag gga 768
Leu Tyr Glu Pro Asn Leu Tyr Arg Val Glu Met Glu Asp Phe Lys Gly
245 250 255
ctg gaa ttg gtt ctt atg ctg ggc gca gtc gtc atc cgg gac gtc tat 816
Leu Glu Leu Val Leu Met Leu Gly Ala Val Val Ile Arg Asp Val Tyr
260 265 270
ttc agt ccg ctc aga gag gcc ttt aat gtt tct gat ccg ccg acc ggt 864
Phe Ser Pro Leu Arg Glu Ala Phe Asn Val Ser Asp Pro Pro Thr Gly
275 280 285
gct ggc aag gtg aag gac gct gct gca aag cca gca gcg acc agc cct 912
Ala Gly Lys Val Lys Asp Ala Ala Ala Lys Pro Ala Ala Thr Ser Pro
290 295 300
aca gga tca tct cca ctg gac gga cca gtg gcg tct ggc gca ttg aat 960
Thr Gly Ser Ser Pro Leu Asp Gly Pro Val Ala Ser Gly Ala Leu Asn
305 310 315 320
gga ggc cct tcc ccc aaa ccg gat cgt ccg cag aga cct cat atc aca1008
Gly Gly Pro Ser Pro Lys Pro Asp Arg Pro Gln Arg Pro His Ile Thr
325 330 335
atc cca cag gag aaa cca caa cga ccg ccg tcg ccc gtg gac ata cgg1056
Ile Pro Gln Glu Lys Pro Gln Arg Pro Pro Ser Pro Val Asp Ile Arg
340 345 350
tcg cag gag gaa atc caa gcc gag aaa gtc cgt atg caa cag cga agg1104
Ser Gln Glu Glu Ile Gln Ala Glu Lys Val Arg Met Gln Gln Arg Arg
355 360 365
gaa tgg gcg gcg cag gag gaa cag cgt cgc acg cga aaa ttg cta gaa1152
Glu Trp Ala Ala Gln Glu Glu Gln Arg Arg Thr Arg Lys Leu Leu Glu
370375 380
gcg gaa gag aag gcc aga cgc cgg cga cag gtg gaa gtc gac aag gag1200
Ala Glu Glu Lys Ala Arg Arg Arg Arg Gln Val Glu Val Asp Lys Glu
385 390 395 400
acg aaa cga ctg cag aag ctc tac ggc gag gaa gag cgg aga gtc ctg1248
Thr Lys Arg Leu Gln Lys Leu Tyr Gly Glu Glu Glu Arg Arg Val Leu
405 410 415
gag cag cag cga cta cag cat tca tca cca gga aaa ccg gct act ccg1296
Glu Gln Gln Arg Leu Gln His Ser Ser Pro Gly Lys Pro Ala Thr Pro
420 425 430
ccg cgc agc agt cag aac aac tgt ctc cag cag cct cag cct cag cat1344
Pro Arg Ser Ser Gln Asn Asn Cys Leu Gln Gln Pro Gln Pro Gln His
435 440 445
cag ata tac cgg cac cac aac tca gcg tcc gta gcg cat ctc aac tct1392
Gln Ile Tyr Arg His His Asn Ser Ala Ser Val Ala His Leu Asn Ser
450 455 460
tcg agt ccc tat ctg caa ggg ccg tta tcg cac tcc tcc gtc cag ctc1440
Ser Ser Pro Tyr Leu Gln Gly Pro Leu Ser His Ser Ser Val Gln Leu
465 470 475 480
ttc caa ccg cca cac ttg gct gcg tcg atg cac agc aac ggc aat gga1488
Phe Gln Pro Pro His Leu Ala Ala Ser Met His Ser Asn Gly Asn Gly
485 490 495
ttg acg ccg cag atg aaa aag aaa agc agt ttc ttt gga ttt cgg aag1536
Leu Thr Pro Gln Met Lys Lys Lys Ser Ser Phe Phe Gly Phe Arg Lys
500 505 510
tct ccc gac gag gct aaa ctt agc aaa aag cgg agt tcc att ggg tta1584
Ser Pro Asp Glu Ala Lys Leu Ser Lys Lys Arg Ser Ser Ile Gly Leu
515 520 525
aca ata ccc aac cgc tct ggg agc atg gaa tgt gac cga tcc aga agg1632
Thr Ile Pro Asn Arg Ser Gly Ser Met Glu Cys Asp Arg Ser Arg Arg
530 535 540
atg cag ggg ctg gcc atc agc cct gca agc tgg gca tgc aac cct tcc1680
Met Gln Gly Leu Ala Ile Ser Pro Ala Ser Trp Ala Cys Asn Pro Ser
545 550 555 560
aac ccg tca cat ggc tcc agc agc tgc agg gtg gct gag tcc aac acg1728
Asn Pro Ser His Gly Ser Ser Ser Cys Arg Val Ala Glu Ser Asn Thr
565 570 575
atg ctc ctt gca gcc caa tcc aat gcg tgg gat gca cat aca gcg ttc1776
Met Leu Leu Ala Ala Gln Ser Asn Ala Trp Asp Ala His Thr Ala Phe
580 585 590
cag cgt gag cca tat cgt cac cac cat cat ctg tgc ccg ctt ggc gca1824
Gln Arg Glu Pro Tyr Arg His His His His Leu Cys Pro Leu Gly Ala
595 600 605
gtg ggt ctg cac tgt tca agc cag gta tat cca tct cga aag tcg gca1872
Val Gly Leu His Cys Ser Ser Gln Val Tyr Pro Ser Arg Lys Ser Ala
610 615 620
tgt gct tcc agg tcc tgc ttc cag gtc cgg ctc ctt ggg cat ccg gat1920
Cys Ala Ser Arg Ser Cys Phe Gln Val Arg Leu Leu Gly His Pro Asp
625 630 635 640
gca gct ttg cat gat tat ctg gtg gag aga aga gaa ggg aag ttg atc1968
Ala Ala Leu His Asp Tyr Leu Val Glu Arg Arg Glu Gly Lys Leu Ile
645 650 655
atc gtc cct gaa aga aaa gaa ctc aca cta gag cac ctg ctg tac cga2016
Ile Val Pro Glu Arg Lys Glu Leu Thr Leu Glu His Leu Leu Tyr Arg
660 665 670
ata acg aat gga gaa gga aag ggg aga cgc gag ttg ctt caa tag2061
Ile Thr Asn Gly Glu Gly Lys Gly Arg Arg Glu Leu Leu Gln
675 680 685
<210>12
<211>686
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>12
Met Arg Ile Tyr Pro Val Arg His Arg Arg Pro Ser Trp Arg Glu Tyr
1 5 10 15
Glu Val Pro Asn Ser His Gly Ser Asn Val Glu Tyr Leu Leu Thr Ser
20 25 30
Thr Thr Ala Ala Phe Tyr Leu Asn Lys Ala Asn Lys Arg Arg Glu Asn
35 40 45
Asn Thr Ile Tyr Phe Thr Tyr Arg Gly Ser Asp Pro Glu Pro Ala Tyr
50 55 60
Ser Leu Arg Tyr Pro Asp Leu Ser Ser Pro Gln Ser Lys Asn Arg Tyr
65 70 75 80
Ala Ala Ala Leu Phe Asp Pro Tyr Val Pro Ser Ile Val Tyr Gly Glu
85 90 95
Val Leu Leu Ile Pro Glu Trp Thr Arg Pro Thr Leu Ser Ala Glu Ala
100 105 110
Ile Arg Gln Asn Gly Gly Ile Pro Pro Pro Pro Glu Pro Ile Leu Pro
115 120 125
Ser Gln Phe Thr Ile Gln Leu Tyr Asn Pro Asp Gln Gln Val Thr Val
130 135 140
His Tyr Lys Pro Lys Ser Trp Asn Ser Pro Ala Thr Trp Ala Phe Glu
145 150 155 160
Met Pro Gln Arg Ser Phe Arg Gln Pro Ser Ser Ser Lys Leu Asp Arg
165 170 175
Thr Gln Ser Asp Pro Ala Val Ser Glu Tyr Thr Pro Lys Leu Lys Phe
180 185 190
Ser Trp Arg Arg Asp Gly Lys Leu Thr Lys Asp Leu Ser Cys Leu Leu
195 200 205
Ser Gly Met Thr Thr Thr Ser Leu Val Glu Pro Lys Thr Lys Arg Lys
210 215 220
Glu Pro Asp Ile Thr Ile Ser Phe Phe Arg Ser Leu Arg Glu Ile Thr
225 230 235 240
Leu Tyr Glu Pro Asn Leu Tyr Arg Val Glu Met Glu Asp Phe Lys Gly
245250255
Leu Glu Leu Val Leu Met Leu Gly Ala Val Val Ile Arg Asp Val Tyr
260 265 270
Phe Ser Pro Leu Arg Glu Ala Phe Asn Val Ser Asp Pro Pro Thr Gly
275 280 285
Ala Gly Lys Val Lys Asp Ala Ala Ala Lys Pro Ala Ala Thr Ser Pro
290 295 300
Thr Gly Ser Ser Pro Leu Asp Gly Pro Val Ala Ser Gly Ala Leu Asn
305 310 315 320
Gly Gly Pro Ser Pro Lys Pro Asp Arg Pro Gln Arg Pro His Ile Thr
325 330 335
Ile Pro Gln Glu Lys Pro Gln Arg Pro Pro Ser Pro Val Asp Ile Arg
340 345 350
Ser Gln Glu Glu Ile Gln Ala Glu Lys Val Arg Met Gln Gln Arg Arg
355 360 365
Glu Trp Ala Ala Gln Glu Glu Gln Arg Arg Thr Arg Lys Leu Leu Glu
370 375 380
Ala Glu Glu Lys Ala Arg Arg Arg Arg Gln Val Glu Val Asp Lys Glu
385 390 395 400
Thr Lys Arg Leu Gln Lys Leu Tyr Gly Glu Glu Glu Arg Arg Val Leu
405 410 415
Glu Gln Gln Arg Leu Gln His Ser Ser Pro Gly Lys Pro Ala Thr Pro
420 425 430
Pro Arg Ser Ser Gln Asn Asn Cys Leu Gln Gln Pro Gln Pro Gln His
435 440 445
Gln Ile Tyr Arg His His Asn Ser Ala Ser Val Ala His Leu Asn Ser
450 455 460
Ser Ser Pro Tyr Leu Gln Gly Pro Leu Ser His Ser Ser Val Gln Leu
465 470 475 480
Phe Gln Pro Pro His Leu Ala Ala Ser Met His Ser Asn Gly Asn Gly
485 490 495
Leu Thr Pro Gln Met Lys Lys Lys Ser Ser Phe Phe Gly Phe Arg Lys
500 505 510
Ser Pro Asp Glu Ala Lys Leu Ser Lys Lys Arg Ser Ser Ile Gly Leu
515 520 525
Thr Ile Pro Asn Arg Ser Gly Ser Met Glu Cys Asp Arg Ser Arg Arg
530 535 540
Met Gln Gly Leu Ala Ile Ser Pro Ala Ser Trp Ala Cys Asn Pro Ser
545 550 555 560
Asn Pro Ser His Gly Ser Ser Ser Cys Arg Val Ala Glu Ser Asn Thr
565 570 575
Met Leu Leu Ala Ala Gln Ser Asn Ala Trp Asp Ala His Thr Ala Phe
580 585 590
Gln Arg Glu Pro Tyr Arg His His His His Leu Cys Pro Leu Gly Ala
595 600 605
Val Gly Leu His Cys Ser Ser Gln Val Tyr Pro Ser Arg Lys Ser Ala
610 615 620
Cys Ala Ser Arg Ser Cys Phe Gln Val Arg Leu Leu Gly His Pro Asp
625 630 635 640
Ala Ala Leu His Asp Tyr Leu Val Glu Arg Arg Glu Gly Lys Leu Ile
645 650 655
Ile Val Pro Glu Arg Lys Glu Leu Thr Leu Glu His Leu Leu Tyr Arg
660 665 670
Ile Thr Asn Gly Glu Gly Lys Gly Arg Arg Glu Leu Leu Gln
675 680 685
<210>13
<211>2139
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(2139)
<400>13
atg ttg gct cat ccc atc agc cat gtg aca gca gca aca aca tcc gca 48
Met Leu Ala His Pro Ile Ser His Val Thr Ala Ala Thr Thr Ser Ala
1 5 10 15
tca cca tca cca tcg cac ccc agc ccg gtc cgc gag act tct tct gaa 96
Ser Pro Ser Pro Ser His Pro Ser Pro Val Arg Glu Thr Ser Ser Glu
20 25 30
cct gac tcc gac tcc gac tcc gac tcc ggg cca tcg cgc acg tca gga144
Pro Asp Ser Asp Ser Asp Ser Asp Ser Gly Pro Ser Arg Thr Ser Gly
35 40 45
gtt gcc gtc atc gct gcc gcc gcc tca tcc tcg aca gtg gag atg gat192
Val Ala Val Ile Ala Ala Ala Ala Ser Ser Ser Thr Val Glu Met Asp
50 55 60
gcc tcg gat tcc gaa atg aca gat gcg ccg gag gat gaa ggg ggc gtc240
Ala Ser Asp Ser Glu Met Thr Asp Ala Pro Glu Asp Glu Gly Gly Val
65 70 75 80
gct atc ggc ccc tac ttg gac ccg cac tat cae agc atg gag gcc atc288
Ala Ile Gly Pro Tyr Leu Asp Pro His Tyr His Ser Met Glu Ala Ile
85 90 95
atg cag cag ctg ggt gat gca acg ggg cac aca tcc atc gat tca act336
Met Gln Gln Leu Gly Asp Ala Thr Gly His Thr Ser Ile Asp Ser Thr
100 105 110
gcg tcg gac aac aat gac gat gac gac gac gac gac gac atg gca gat384
Ala Ser Asp Asn Asn Asp Asp Asp Asp Asp Asp Asp Asp Met Ala Asp
115 120 125
cag cat cgc cgc cat gta tac tcg ggt gag gaa ctg ccg gta tct ggt432
Gln His Arg Arg His Val Tyr Ser Gly Glu Glu Leu Pro Val Ser Gly
130 135 140
gtt gaa tca tcg tct gct cca tgg ggc ctg cat gat atc ctc gat ggc480
Val Glu Ser Ser Ser Ala Pro Trp Gly Leu His Asp Ile Leu Asp Gly
145 150 155 160
tat ggt cac gtc cac tcg aac ttg tct gat gtc gat gtc gat gat ttc528
Tyr Gly His Val His Ser Asn Leu Ser Asp Val Asp Val Asp Asp Phe
165 170 175
tac cga gcc ttt aac tac gac gcc ttg ccc cag ctt ccg ggc acc tct576
Tyr Arg Ala Phe Asn Tyr Asp Ala Leu Pro Gln Leu Pro Gly Thr Ser
180 185 190
gcg aca cct ggt cac gac gat cat gct tac tcc tac gat gat gac tcg624
Ala Thr Pro Gly His Asp Asp His Ala Tyr Ser Tyr Asp Asp Asp Ser
195 200 205
acc gct ggt ctg ggc gca gat gac cac ttt cct gct tct gtc gta cat672
Thr Ala Gly Leu Gly Ala Asp Asp His Phe Pro Ala Ser Val Val His
210 215 220
gga aca acg tct gaa cga aat ctc acg gtc gac cag ttt att gac caa720
Gly Thr Thr Ser Glu Arg Asn Leu Thr Val Asp Gln Phe Ile Asp Gln
225 230 235 240
tgg ctt atc caa tca tcc tcc aca cac ctt cgg tac ttt ctt cca cct768
Trp Leu Ile Gln Ser Ser Ser Thr His Leu Arg Tyr Phe Leu Pro Pro
245 250 255
aaa cca tcc att ccg caa cgc tac ggc ttg gcc ctt ttg aac tgg gcg816
Lys Pro Ser Ile Pro Gln Arg Tyr Gly Leu Ala Leu Leu Asn Trp Ala
260 265 270
cct cct ccg agg ata ctt cgg ccc agt cgg tac ccc gat gca tcc tac864
Pro Pro Pro Arg Ile Leu Arg Pro Ser Arg Tyr Pro Asp Ala Ser Tyr
275 280 285
gat atc caa cag atc ccg tgg tgg gaa gtt atg cgc gtc aag aga ccc 912
Asp Ile Gln Gln Ile Pro Trp Trp Glu Val Met Arg Val Lys Arg Pro
290 295 300
cag gtc cgg gcc ttg cgc gac gcc tgt tat acg tca tat cat aat ctg 960
Gln Val Arg Ala Leu Arg Asp Ala Cys Tyr Thr Ser Tyr His Asn Leu
305 310 315 320
gag tac tcc cct gga gta aga cgt gtc tcg tac ctt ggt caa atg gct1008
Glu Tyr Ser Pro Gly Val Arg Arg Val Ser Tyr Leu Gly Gln Met Ala
325 330 335
ccg cct gac gac gag act ttc ttt cgg ggc aag tcc atg cac acg aag1056
Pro Pro Asp Asp Glu Thr Phe Phe Arg Gly Lys Ser Met His Thr Lys
340 345 350
cac agg gca acc atc gaa cac ttt cag ctt cgg aat ctc atg tcg gtc1104
His Arg Ala Thr Ile Glu His Phe Gln Leu Arg Asn Leu Met Ser Val
355 360 365
gtc tca cac aac acc gtg gaa ttt gct cat gag tca aaa ctg tat tcc1152
Val Ser His Asn Thr Val Glu Phe Ala His Glu Ser Lys Leu Tyr Ser
370 375 380
tgg gtc ccc ggc tac gat gat ctg gtc tgc ttg att gat ctt tcc aaa1200
Trp Val Pro Gly Tyr Asp Asp Leu Val Cys Leu Ile Asp Leu Ser Lys
385 390 395 400
cca tcc gtg gaa tcc tgc ttc cag tgc ccc gtc aag atc tcc aca atg1248
Pro Ser Val Glu Ser Cys Phe Gln Cys Pro Val Lys Ile Ser Thr Met
405 410 415
agc tcc cgc cac ggc gtc tcg att gct ggt ggc ttc tgc ggc gaa tat1296
Ser Ser Arg His Gly Val Ser Ile Ala Gly Gly Phe Cys Gly Glu Tyr
420 425 430
gca ttg cgt gca gca ggc acg gac gag cca gct gcg gag ggc tac gtg1344
Ala Leu Arg Ala Ala Gly Thr Asp Glu Pro Ala Ala Glu Gly Tyr Val
435 440 445
acg aag gat ttc aat ggc atc aca acc cac atc gac atc gtc aaa cac1392
Thr Lys Asp Phe Asn Gly Ile Thr Thr His Ile Asp Ile Val Lys His
450 455 460
cgg acc agt cgg tct cct acg gcg atc gtt gct tcc aat gac aag cac1440
Arg Thr Ser Arg Ser Pro Thr Ala Ile Val Ala Ser Asn Asp Lys His
465 470 475 480
ctg cgg gtt ctg gac tgc gag tcc aac cag ttt gtg gcg gat tat gcc1488
Leu Arg Val Leu Asp Cys Glu Ser Asn Gln Phe Val Ala Asp Tyr Ala
485 490 495
ctc ttc tgc gcg gtc aac tgc acc gct acg tca ccc gac ggt cgc ctt1536
Leu Phe Cys Ala Val Asn Cys Thr Ala Thr Ser Pro Asp Gly Arg Leu
500 505 510
cgt gca gtc gtc ggc gac tcc ccg gac gca tgg gtc atc gaa gcc gac1584
Arg Ala Val Val Gly Asp Ser Pro Asp Ala Trp Val Ile Glu Ala Asp
515 520 525
acc gga cga cct gtc cat ccg ctc cgt ggc cac cgt gat ttc gga ttc1632
Thr Gly Arg Pro Val His Pro Leu Arg Gly His Arg Asp Phe Gly Phe
530 535 540
gct tgt gcc tgg tcc ccg gac atg cgt cac atc gcg acc ggt aat cag1680
Ala Cys Ala Trp Ser Pro Asp Met Arg His Ile Ala Thr Gly Asn Gln
545 550 555 560
gac aaa acc gtg atc atc tgg gat gcg cgc acc tgg cgc atc ctc aag1728
Asp Lys Thr Val Ile Ile Trp Asp Ala Arg Thr Trp Arg Ile Leu Lys
565 570 575
acg atc gaa tcc gac gtg gca ggc tac cga tcc ctc cga ttc tcc ccc1776
Thr Ile Glu Ser Asp Val Ala Gly Tyr Arg Ser Leu Arg Phe Ser Pro
580 585 590
gtc ggt ggg ggt cct ccg acg tta ctg ttg tgc gaa ccc gcg gac cgg1824
Val Gly Gly Gly Pro Pro Thr Leu Leu Leu Cys Glu Pro Ala Asp Arg
595 600 605
atc gcc att gtc gac gca cag acg tac caa agt cga cag gtc cat gat1872
Ile Ala Ile Val Asp Ala Gln Thr Tyr Gln Ser Arg Gln Val His Asp
610 615 620
ttc ttc ggg gag atc ggg ggc gcc gat tac acg ccc gat ggc ggg gct1920
Phe Phe Gly Glu Ile Gly Gly Ala Asp Tyr Thr Pro Asp Gly Gly Ala
625 630 635 640
atc tgg gtg gcc aac acc gat cag cat ttc ggg ggg ttc atg gag tat1968
Ile Trp Val Ala Asn Thr Asp Gln His Phe Gly Gly Phe Met Glu Tyr
645 650 655
gaa cga tgg cag tgg ggg aag aga ctt ggt ctc agt gat ctg ccg aat2016
Glu Arg Trp Gln Trp Gly Lys Arg Leu Gly Leu Ser Asp Leu Pro Asn
660 665 670
gag tgg tta ccg gag tcg gag ttg gag aat gat gag cga tgc gtt ctg2064
Glu Trp Leu Pro Glu Ser Glu Leu Glu Asn Asp Glu Arg Cys Val Leu
675 680 685
agc gca cga gag aga cag atg aga ttc tgc agg aat cta acc gat gaa2112
Ser Ala Arg Glu Arg Gln Met Arg Phe Cys Arg Asn Leu Thr Asp Glu
690 695 700
gaa cat gat gag ttg ttg cta gcc tga2139
Glu His Asp Glu Leu Leu Leu Ala
705 710
<210>14
<211>712
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>14
Met Leu Ala His Pro Ile Ser His Val Thr Ala Ala Thr Thr Ser Ala
1 5 10 15
Ser Pro Ser Pro Ser His Pro Ser Pro Val Arg Glu Thr Ser Ser Glu
20 25 30
Pro Asp Ser Asp Ser Asp Ser Asp Ser Gly Pro Ser Arg Thr Ser Gly
35 40 45
Val Ala Val Ile Ala Ala Ala Ala Ser Ser Ser Thr Val Glu Met Asp
50 55 60
Ala Ser Asp Ser Glu Met Thr Asp Ala Pro Glu Asp Glu Gly Gly Val
65 70 75 80
Ala Ile Gly Pro Tyr Leu Asp Pro His Tyr His Ser Met Glu Ala Ile
85 90 95
Met Gln Gln Leu Gly Asp Ala Thr Gly His Thr Ser Ile Asp Ser Thr
100 105 110
Ala Ser Asp Asn Asn Asp Asp Asp Asp Asp Asp Asp Asp Met Ala Asp
115 120 125
Gln His Arg Arg His Val Tyr Ser Gly Glu Glu Leu Pro Val Ser Gly
130 135 140
Val Glu Ser Ser Ser Ala Pro Trp Gly Leu His Asp Ile Leu Asp Gly
145 150 155 160
Tyr Gly His Val His Ser Asn Leu Ser Asp Val Asp Val Asp Asp Phe
165 170 175
Tyr Arg Ala Phe Asn Tyr Asp Ala Leu Pro Gln Leu Pro Gly Thr Ser
180 185 190
Ala Thr Pro Gly His Asp Asp His Ala Tyr Ser Tyr Asp Asp Asp Ser
195 200 205
Thr Ala Gly Leu Gly Ala Asp Asp His Phe Pro Ala Ser Val Val His
210 215 220
Gly Thr Thr Ser Glu Arg Asn Leu Thr Val Asp Gln Phe Ile Asp Gln
225 230 235 240
Trp Leu Ile Gln Ser Ser Ser Thr His Leu Arg Tyr Phe Leu Pro Pro
245 250 255
Lys Pro Ser Ile Pro Gln Arg Tyr Gly Leu Ala Leu Leu Asn Trp Ala
260 265 270
Pro Pro Pro Arg Ile Leu Arg Pro Ser Arg Tyr Pro Asp Ala Ser Tyr
275 280 285
Asp Ile Gln Gln Ile Pro Trp Trp Glu Val Met Arg Val Lys Arg Pro
290 295 300
Gln Val Arg Ala Leu Arg Asp Ala Cys Tyr Thr Ser Tyr His Asn Leu
305 310 315 320
Glu Tyr Ser Pro Gly Val Arg Arg Val Ser Tyr Leu Gly Gln Met Ala
325 330 335
Pro Pro Asp Asp Glu Thr Phe Phe Arg Gly Lys Ser Met His Thr Lys
340 345 350
His Arg Ala Thr Ile Glu His Phe Gln Leu Arg Asn Leu Met Ser Val
355 360 365
Val Ser His Asn Thr Val Glu Phe Ala His Glu Ser Lys Leu Tyr Ser
370 375 380
Trp Val Pro Gly Tyr Asp Asp Leu Val Cys Leu Ile Asp Leu Ser Lys
385 390 395 400
Pro Ser Val Glu Ser Cys Phe Gln Cys Pro Val Lys Ile Ser Thr Met
405 410 415
Ser Ser Arg His Gly Val Ser Ile Ala Gly Gly Phe Cys Gly Glu Tyr
420 425 430
Ala Leu Arg Ala Ala Gly Thr Asp Glu Pro Ala Ala Glu Gly Tyr Val
435 440 445
Thr Lys Asp Phe Asn Gly Ile Thr Thr His Ile Asp Ile Val Lys His
450 455 460
Arg Thr Ser Arg Ser Pro Thr Ala Ile Val Ala Ser Asn Asp Lys His
465 470 475 480
Leu Arg Val Leu Asp Cys Glu Ser Asn Gln Phe Val Ala Asp Tyr Ala
485 490 495
Leu Phe Cys Ala Val Asn Cys Thr Ala Thr Ser Pro Asp Gly Arg Leu
500 505 510
Arg Ala Val Val Gly Asp Ser Pro Asp Ala Trp Val Ile Glu Ala Asp
515 520 525
Thr Gly Arg Pro Val His Pro Leu Arg Gly His Arg Asp Phe Gly Phe
530 535 540
Ala Cys Ala Trp Ser Pro Asp Met Arg His Ile Ala Thr Gly Asn Gln
545 550 555 560
Asp Lys Thr Val Ile Ile Trp Asp Ala Arg Thr Trp Arg Ile Leu Lys
565 570 575
Thr Ile Glu Ser Asp Val Ala Gly Tyr Arg Ser Leu Arg Phe Ser Pro
580 585 590
Val Gly Gly Gly Pro Pro Thr Leu Leu Leu Cys Glu Pro Ala Asp Arg
595 600 605
Ile Ala Ile Val Asp Ala Gln Thr Tyr Gln Ser Arg Gln Val His Asp
610 615 620
Phe Phe Gly Glu Ile Gly Gly Ala Asp Tyr Thr Pro Asp Gly Gly Ala
625 630 635 640
Ile Trp Val Ala Asn Thr Asp Gln His Phe Gly Gly Phe Met Glu Tyr
645 650 655
Glu Arg Trp Gln Trp Gly Lys Arg Leu Gly Leu Ser Asp Leu Pro Asn
660 665 670
Glu Trp Leu Pro Glu Ser Glu Leu Glu Asn Asp Glu Arg Cys Val Leu
675 680 685
Ser Ala Arg Glu Arg Gln Met Arg Phe Cys Arg Asn Leu Thr Asp Glu
690 695 700
Glu Hi s Asp Glu Leu Leu Leu Ala
705 710
<210>15
<211>282
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(282)
<400>15
atg gcg ttt aca tca gcg aca gat atc ata ttg ggc gtg ggt gca tca 48
Met Ala Phe Thr Ser Ala Thr Asp Ile Ile Leu Gly Val Gly Ala Ser
1 5 10 15
tcc ggg gta aca agt gtg tcg gac tcg ccc ccg agt tcc ccg aag ccg 96
Ser Gly Val Thr Ser Val Ser Asp Ser Pro Pro Ser Ser Pro Lys Pro
20 25 30
cag cga gat gta agg cgg att tgg acg aga ggg agg aac cgc acg ggg144
Gln Arg Asp Val Arg Arg Ile Trp Thr Arg Gly Arg Asn Arg Thr Gly
35 40 45
agt ggg agt gac ctt ggg att ggg act ctg ctg gag gag aaa tgt agg192
Ser Gly Ser Asp Leu Gly Ile Gly Thr Leu Leu Glu Glu Lys Cys Arg
50 55 60
att gca gga gac tgc aat tat gtg gtg tcg cgc tat ctg ggg cat cca240
Ile Ala Gly Asp Cys Asn Tyr Val Val Ser Arg Tyr Leu Gly His Pro
65 70 75 80
ggg agg gtt gcg gtc tac att tgt tgt gag agc gat gga tga282
Gly Arg Val Ala Val Tyr Ile Cys Cys Glu Ser Asp Gly
85 90
<210>16
<211>93
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>16
Met Ala Phe Thr Ser Ala Thr Asp Ile Ile Leu Gly Val Gly Ala Ser
1 5 10 15
Ser Gly Val Thr Ser Val Ser Asp Ser Pro Pro Ser Ser Pro Lys Pro
20 25 30
Gln Arg Asp Val Arg Arg Ile Trp Thr Arg Gly Arg Asn Arg Thr Gly
35 40 45
Ser Gly Ser Asp Leu Gly Ile Gly Thr Leu Leu Glu Glu Lys Cys Arg
50 55 60
Ile Ala Gly Asp Cys Asn Tyr Val Val Ser Arg Tyr Leu Gly His Pro
65 70 75 80
Gly Arg Val Ala Val Tyr Ile Cys Cys Glu Ser Asp Gly
85 90
<210>17
<211>1758
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(1758)
<400>17
atg gtg tgc cca att aca ctt aca aag ttc gtc ggg aca gtg tct ctc 48
Met Val Cys Pro Ile Thr Leu Thr Lys Phe Val Gly Thr Val Ser Leu
1 5 10 15
ggc ctc ttg acg ggc ctt tcc tac tcc tcc gcc aac atc acc att ccc 96
Gly Leu Leu Thr Gly Leu Ser Tyr Ser Ser Ala Asn Ile Thr Ile Pro
20 25 30
tcc ctg caa cac ctc cca acc tcc tcc acc gcc gcc cgc tgc ctc gac144
Ser Leu Gln His Leu Pro Thr Ser Ser Thr Ala Ala Arg Cys Leu Asp
35 40 45
gag atc aag cgg tta agc cgc aaa cac gcc ctc cgg ctc gcc aac atc192
Glu Ile Lys Arg Leu Ser Arg Lys His Ala Leu Arg Leu Ala Asn Ile
50 55 60
gcc aac ggc tgc gtc ctg atc gct ttc ttc gtc tcc cca gcc cac cgc240
Ala Asn Gly Cys Val Leu Ile Ala Phe Phe Val Ser Pro Ala His Arg
65 70 75 80
aaa cat ccc tac ctg ctc tgg gtg tcc atg gtc tcg acc ttg ggc tcg288
Lys His Pro Tyr Leu Leu Trp Val Ser Met Val Ser Thr Leu Gly Ser
85 90 95
tac ggc gtc gac ttc tgg ttc cac cgg gac atc ggg ttg aaa gag tgg336
Tyr Gly Val Asp Phe Trp Phe His Arg Asp Ile Gly Leu Lys Glu Trp
100 105 110
ttt ctc agc ttg gtc cgg gat acc cag ctg ctt cct agc ggc tgt ctt384
Phe Leu Ser Leu Val Arg Asp Thr Gln Leu Leu Pro Ser Gly Cys Leu
115 120 125
ccg gcg aag acc acc tct ccg aag aaa gaa gat gac atc gtt gtc gtc432
Pro Ala Lys Thr Thr Ser Pro Lys Lys Glu Asp Asp Ile Val Val Val
130 135 140
gag gcg gag gat aag gtt aac ggt gag agc gta cag aag gaa atg gag480
Glu Ala Glu Asp Lys Val Asn Gly Glu Ser Val Gln Lys Glu Met Glu
145 150 155 160
aag gaa cgt cta ctt cag aga aca aga tcg tgg ctt acg ggg act gcc528
Lys Glu Arg Leu Leu Gln Arg Thr Arg Ser Trp Leu Thr Gly Thr Ala
165 170 175
ctt gcc atg gcg att gtc ggg ctc tgg ggg gac aga gaa ttc cgc gat576
Leu Ala Met Ala Ile Val Gly Leu Trp Gly Asp Arg Glu Phe Arg Asp
180 185 190
tgt cat cat cgc att tcc tcg gct atc gca acc cac tct cca caa cgg624
Cys His His Arg Ile Ser Ser Ala Ile Ala Thr His Ser Pro Gln Arg
195 200 205
ctt cag gcg agc cac aat aat tgt cac cat cac cac aac ctt tca ggg672
Leu Gln Ala Ser His Asn Asn Cys His His His His Asn Leu Ser Gly
210 215 220
aat ttc aca act tct tgc cgc ccc ctt tca tca acc gct gca tca tct720
Asn Phe Thr Thr Ser Cys Arg Pro Leu Ser Ser Thr Ala Ala Ser Ser
225 230 235 240
gca ggc gcg acc atc atg tct cca ttc gct tct tcg aat tct tac cac768
Ala Gly Ala Thr Ile Met Ser Pro Phe Ala Ser Ser Asn Ser Tyr His
245 250 255
cca gat cgg ttg gcg act gtt ctg act cag aac aag gag tgg gcg aca816
Pro Asp Arg Leu Ala Thr Val Leu Thr Gln Asn Lys Glu Trp Ala Thr
260 265 270
aaa aca gcc caa gag cat cca gat ctc ttc cca acc ctt gcg acc ggc864
Lys Thr Ala Gln Glu His Pro Asp Leu Phe Pro Thr Leu Ala Thr Gly
275 280 285
caa agt cct gaa att gtg tgg atc gga tgc tcc gac tca cga tgc cct912
Gln Ser Pro Glu Ile Val Trp Ile Gly Cys Ser Asp Ser Arg Cys Pro
290 295 300
gag acg acg ctc ctt ggc ctg aag cct ggc gat gtc ttc gtc cac agg960
Glu Thr Thr Leu Leu Gly Leu Lys Pro Gly Asp Val Phe Val His Arg
305 310 315 320
aat atc gcc aac atc ctt cac cca acc gat ctc agc tcg tct gcc gtc 1008
Asn Ile Ala Asn Ile Leu His Pro Thr Asp Leu Ser Ser Ser Ala Val
325 330 335
att gaa tat gcc gta aaa caa ctt aaa gtc aag cac atc gtt ctg tgc 1056
Ile Glu Tyr Ala Val Lys Gln Leu Lys Val Lys His Ile Val Leu Cys
340 345 350
gga cac act agc tgc gga ggc gtg gcg gct gca cta gga aat aag cag 1104
Gly His Thr Ser Cys Gly Gly Val Ala Ala Ala Leu Gly Asn Lys Gln
355 360 365
ctt ggg att ctg gac cca tgg ctg ctc cca ctg cga cag att cgt gaa 1152
Leu Gly Ile Leu Asp Pro Trp Leu Leu Pro Leu Arg Gln Ile Arg Glu
370 375 380
caa cat ctg gac act ctc cag tct ctt tca act gag gag cgg gca ctg 1200
Gln His Leu Asp Thr Leu Gln Ser Leu Ser Thr Glu Glu Arg Ala Leu
385390 395 400
aag ctg acc gaa ctg aac gta ttg gct ggg gtc aag gta ttg aag cag 1248
Lys Leu Thr Glu Leu Asn Val Leu Ala Gly Val Lys Val Leu Lys Gln
405 410 415
aag agc gcc gtg ttg gat gct att caa gag cgc ggt ctg tgc gtg cat 1296
Lys Ser Ala Val Leu Asp Ala Ile Gln Glu Arg Gly Leu Cys Val His
420 425 430
ggt ctg ata tat gat gtg gct agt ggg atg ctg cgg gag ttg gag acg1344
Gly Leu Ile Tyr Asp Val Ala Ser Gly Met Leu Arg Glu Leu Glu Thr
435 440 445
gga gag ccg gac gag gta aac aga cca cca ggt cca ggg cga tgt tta1392
Gly Glu Pro Asp Glu Val Asn Arg Pro Pro Gly Pro Gly Arg Cys Leu
450 455 460
cgg gac ggc gta cgg ggt gac cat acc gaa agc ggc aaa act aac ccg1440
Arg Asp Gly Val Arg Gly Asp His Thr Glu Ser Gly Lys Thr Asn Pro
465 470 475 480
aac agg gaa tcc aca ggt gct cgc att acg gaa gta ata cag atg ctt1488
Asn Arg Glu Ser Thr Gly Ala Arg Ile Thr Glu Val Ile Gln Met Leu
485 490 495
ctt aat cat gga gct gac gtc aat ggc cag aat tgt gga agc gca ctg1536
Leu Asn His Gly Ala Asp Val Asn Gly Gln Asn Cys Gly Ser Ala Leu
500 505 510
gcc gct gct gct aag gca gga aat ata gaa gca gca caa ctt ctt ctt1584
Ala Ala Ala Ala Lys Ala Gly Asn Ile Glu Ala Ala Gln Leu Leu Leu
515 520 525
gac aat ggt gca gat gtc aat gct cgg att tca gaa agt ccg ttt ttt1632
Asp Asn Gly Ala Asp Val Asn Ala Arg Ile Ser Glu Ser Pro Phe Phe
530 535 540
gac aaa tat ggg ggt gcc ttt gtt gct gct gcc tcc tct cca gga aac1680
Asp Lys Tyr Gly Gly Ala Phe Val Ala Ala Ala Ser Ser Pro Gly Asn
545 550 555 560
aga aga atg gtg caa cta ctt ctt gac cat gga gca gac ata aat acc1728
Arg Arg Met Val Gln Leu Leu Leu Asp His Gly Ala Asp Ile Asn Thr
565 570 575
cat agt ctg cat att atg gca atg ccc tga1758
His Ser Leu His Ile Met Ala Met Pro
580585
<210>18
<211>585
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>18
Met Val Cys Pro Ile Thr Leu Thr Lys Phe Val Gly Thr Val Ser Leu
1 5 10 15
Gly Leu Leu Thr Gly Leu Ser Tyr Ser Ser Ala Asn Ile Thr Ile Pro
20 25 30
Ser Leu Gln His Leu Pro Thr Ser Ser Thr Ala Ala Arg Cys Leu Asp
35 40 45
Glu Ile Lys Arg Leu Ser Arg Lys His Ala Leu Arg Leu Ala Asn Ile
50 55 60
Ala Asn Gly Cys Val Leu Ile Ala Phe Phe Val Ser Pro Ala His Arg
65 70 75 80
Lys His Pro Tyr Leu Leu Trp Val Ser Met Val Ser Thr Leu Gly Ser
85 90 95
Tyr Gly Val Asp Phe Trp Phe His Arg Asp Ile Gly Leu Lys Glu Trp
100 105 110
Phe Leu Ser Leu Val Arg Asp Thr Gln Leu Leu Pro Ser Gly Cys Leu
115 120 125
Pro Ala Lys Thr Thr Ser Pro Lys Lys Glu Asp Asp Ile Val Val Val
130 135 140
Glu Ala Glu Asp Lys Val Asn Gly Glu Ser Val Gln Lys Glu Met Glu
145 150 155 160
Lys Glu Arg Leu Leu Gln Arg Thr Arg Ser Trp Leu Thr Gly Thr Ala
165 170 175
Leu Ala Met Ala Ile Val Gly Leu Trp Gly Asp Arg Glu Phe Arg Asp
180 185 190
Cys His His Arg Ile Ser Ser Ala Ile Ala Thr His Ser Pro Gln Arg
195 200 205
Leu Gln Ala Ser His Asn Asn Cys His His His His Asn Leu Ser Gly
210 215 220
Asn Phe Thr Thr Ser Cys Arg Pro Leu Ser Ser Thr Ala Ala Ser Ser
225230 235 240
Ala Gly Ala Thr Ile Met Ser Pro Phe Ala Ser Ser Asn Ser Tyr His
245 250 255
Pro Asp Arg Leu Ala Thr Val Leu Thr Gln Asn Lys Glu Trp Ala Thr
260 265 270
Lys Thr Ala Gln Glu His Pro Asp Leu Phe Pro Thr Leu Ala Thr Gly
275 280 285
Gln Ser Pro Glu Ile Val Trp Ile Gly Cys Ser Asp Ser Arg Cys Pro
290 295 300
Glu Thr Thr Leu Leu Gly Leu Lys Pro Gly Asp Val Phe Val His Arg
305 310 315 320
Asn Ile Ala Asn Ile Leu His Pro Thr Asp Leu Ser Ser Ser Ala Val
325 330 335
Ile Glu Tyr Ala Val Lys Gln Leu Lys Val Lys His Ile Val Leu Cys
340 345 350
Gly His Thr Ser Cys Gly Gly Val Ala Ala Ala Leu Gly Asn Lys Gln
355 360 365
Leu Gly Ile Leu Asp Pro Trp Leu Leu Pro Leu Arg Gln Ile Arg Glu
370 375 380
Gln His Leu Asp Thr Leu Gln Ser Leu Ser Thr Glu Glu Arg Ala Leu
385 390 395 400
Lys Leu Thr Glu Leu Asn Val Leu Ala Gly Val Lys Val Leu Lys Gln
405 410 415
Lys Ser Ala Val Leu Asp Ala Ile Gln Glu Arg Gly Leu Cys Val His
420 425 430
Gly Leu Ile Tyr Asp Val Ala Ser Gly Met Leu Arg Glu Leu Glu Thr
435 440 445
Gly Glu Pro Asp Glu Val Asn Arg Pro Pro Gly Pro Gly Arg Cys Leu
450 455 460
Arg Asp Gly Val Arg Gly Asp His Thr Glu Ser Gly Lys Thr Asn Pro
465 470 475 480
Asn Arg Glu Ser Thr Gly Ala Arg Ile Thr Glu Val Ile Gln Met Leu
485 490 495
Leu Asn His Gly Ala Asp Val Asn Gly Gln Asn Cys Gly Ser Ala Leu
500 505 510
Ala Ala Ala Ala Lys Ala Gly Asn Ile Glu Ala Ala Gln Leu Leu Leu
515 520 525
Asp Asn Gly Ala Asp Val Asn Ala Arg Ile Ser Glu Ser Pro Phe Phe
530 535 540
Asp Lys Tyr Gly Gly Ala Phe Val Ala Ala Ala Ser Ser Pro Gly Asn
545 550 555 560
Arg Arg Met Val Gln Leu Leu Leu Asp His Gly Ala Asp Ile Asn Thr
565 570 575
His Ser Leu His Ile Met Ala Met Pro
580 585
<210>19
<211>999
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(999)
<400>19
atg tcc acc ccc aag tca gat tac gtc agt gac gac tgg aag gac ggc48
Met Ser Thr Pro Lys Ser Asp Tyr Val Ser Asp Asp Trp Lys Asp Gly
1 5 10 15
ttg ttc agt gag tct tcg atc ttt tat tct cct caa gca aag cga att96
Leu Phe Ser Glu Ser Ser Ile Phe Tyr Ser Pro Gln Ala Lys Arg Ile
20 25 30
cta att gcc aat ctc gtc gag tta gaa aat aag gtc gtc ttc tgt acc 144
Leu Ile Ala Asn Leu Val Glu Leu Glu Asn Lys Val Val Phe Cys Thr
35 40 45
ggt ggt gcg ggc acc atc tgc agc gct caa gtc cgt gcc ctg gtc cat 192
Gly Gly Ala Gly Thr Ile Cys Ser Ala Gln Val Arg Ala Leu Val His
50 55 60
cta ggt gca gat gcc tgc atc gtg ggg aga aac gtc gag aag aca gaa 240
Leu Gly Ala Asp Ala Cys Ile Val Gly Arg Asn Val Glu Lys Thr Glu
65 70 75 80
cgt gct gcc aag gac att gcg agt gtt aga gcc ggc gcg aga gtc atc 288
Arg Ala Ala Lys Asp Ile Ala Ser Val Arg Ala Gly Ala Arg Val Ile
85 90 95
ggt att ggc gcg gtg gat gtg cgg aaa tat gac agt ctg aag gac gct 336
Gly Ile Gly Ala Val Asp Val Arg Lys Tyr Asp Ser Leu Lys Asp Ala
100 105 110
gct gag cgc tgc att aag gag ttg ggt ggc att gac ttt gtg att gca384
Ala Glu Arg Cys Ile Lys Glu Leu Gly Gly Ile Asp Phe Val Ile Ala
115 120 125
ggc gcg gca gga aac ttc ctt gcg tcg atc aac caa ctt tcc gtg aac432
Gly Ala Ala Gly Asn Phe Leu Ala Ser Ile Asn Gln Leu Ser Val Asn
130 135 140
gcc ttc aag tca gtg atg gac atc gat gtc ctg ggc tcc tac aac acc480
Ala Phe Lys Ser Val Met Asp Ile Asp Val Leu Gly Ser Tyr Asn Thr
145 150 155 160
gtc aag gcc acc att ccg tac ctc gtt gaa tca gcg aag aaa cac aaa528
Val Lys Ala Thr Ile Pro Tyr Leu Val Glu Ser Ala Lys Lys His Lys
165 170 175
gtt gat tcc aaa acc ctc cag cct tcc cct gcc ggc aca ggt gga aga576
Val Asp Ser Lys Thr Leu Gln Pro Ser Pro Ala Gly Thr Gly Gly Arg
180 185 190
atc atc ttc gtc agt gcc acg ctc cac tac aga gga tca ccc ttc cag624
Ile Ile Phe Val Ser Ala Thr Leu His Tyr Arg Gly Ser Pro Phe Gln
195 200 205
acg cat gtg gct gtc gcc aaa gcc gga gtg gat gcg ctg tcg aac aat672
Thr His Val Ala Val Ala Lys Ala Gly Val Asp Ala Leu Ser Asn Asn
210 215 220
gtg gct atc gaa ttt ggc cct ctg gga gta act tcc aat gtc att gct720
Val Ala Ile Glu Phe Gly Pro Leu Gly Val Thr Ser Asn Val Ile Ala
225 230 235 240
cct gga ccg att gca caa acg gag ggt ctc gaa cgt ctc ctc ccg ccg768
Pro Gly Pro Ile Ala Gln Thr Glu Gly Leu Glu Arg Leu Leu Pro Pro
245 250 255
gat gtc aaa gaa atg tac acc aaa tcg caa cca ctt ggt cgg ctg gga816
Asp Val Lys Glu Met Tyr Thr Lys Ser Gln Pro Leu Gly Arg Leu Gly
260 265 270
tct gtc aga gac atc gcc gat gcg acg gta tat ctc ttg tcg aac act864
Ser Val Arg Asp Ile Ala Asp Ala Thr Val Tyr Leu Leu Ser Asn Thr
275 280 285
gga agc tat gta aat gga caa tta tta gtt gtt gac ggt ggc tcc tgg912
Gly Ser Tyr Val Asn Gly Gln Leu Leu Val Val Asp Gly Gly Ser Trp
290 295 300
cgc acc agc ggc gat ttc tcg tac ccg gac ttc ttg ttg gca gga gga960
Arg Thr Ser Gly Asp Phe Ser Tyr Pro Asp Phe Leu Leu Ala Gly Gly
305 310 315 320
gaa ttt gaa gga gtg aag ggg aag aaa tcg aaa ctt tga999
Glu Phe Glu Gly Val Lys Gly Lys Lys Ser Lys Leu
325 330
<210>20
<211>332
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>20
Met Ser Thr Pro Lys Ser Asp Tyr Val Ser Asp Asp Trp Lys Asp Gly
1 5 10 15
Leu Phe Ser Glu Ser Ser Ile Phe Tyr Ser Pro Gln Ala Lys Arg Ile
20 25 30
Leu Ile Ala Asn Leu Val Glu Leu Glu Asn Lys Val Val Phe Cys Thr
35 40 45
Gly Gly Ala Gly Thr Ile Cys Ser Ala Gln Val Arg Ala Leu Val His
50 55 60
Leu Gly Ala Asp Ala Cys Ile Val Gly Arg Asn Val Glu Lys Thr Glu
65 70 75 80
Arg Ala Ala Lys Asp Ile Ala Ser Val Arg Ala Gly Ala Arg Val Ile
85 90 95
Gly Ile Gly Ala Val Asp Val Arg Lys Tyr Asp Ser Leu Lys Asp Ala
100 105 110
Ala Glu Arg Cys Ile Lys Glu Leu Gly Gly Ile Asp Phe Val Ile Ala
115 120 125
Gly Ala Ala Gly Asn Phe Leu Ala Ser Ile Asn Gln Leu Ser Val Asn
130 135 140
Ala Phe Lys Ser Val Met Asp Ile Asp Val Leu Gly Ser Tyr Asn Thr
145 150 155 160
Val Lys Ala Thr Ile Pro Tyr Leu Val Glu Ser Ala Lys Lys His Lys
165 170 175
Val Asp Ser Lys Thr Leu Gln Pro Ser Pro Ala Gly Thr Gly Gly Arg
180 185 190
Ile Ile Phe Val Ser Ala Thr Leu His Tyr Arg Gly Ser Pro Phe Gln
195 200 205
Thr His Val Ala Val Ala Lys Ala Gly Val Asp Ala Leu Ser Asn Asn
210 215 220
Val Ala Ile Glu Phe Gly Pro Leu Gly Val Thr Ser Asn Val Ile Ala
225 230 235 240
Pro Gly Pro Ile Ala Gln Thr Glu Gly Leu Glu Arg Leu Leu Pro Pro
245 250 255
Asp Val Lys Glu Met Tyr Thr Lys Ser Gln Pro Leu Gly Arg Leu Gly
260 265 270
Ser Val Arg Asp Ile Ala Asp Ala Thr Val Tyr Leu Leu Ser Asn Thr
275 280 285
Gly Ser Tyr Val Asn Gly Gln Leu Leu Val Val Asp Gly Gly Ser Trp
290 295 300
Arg Thr Ser Gly Asp Phe Ser Tyr Pro Asp Phe Leu Leu Ala Gly Gly
305 310 315 320
Glu Phe Glu Gly Val Lys Gly Lys Lys Ser Lys Leu
325 330
<210>21
<211>2400
<212>DNA
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<220>
<221>CDS
<222>(1)..(2400)
<400>21
atg gga ggg gaa cac aga gag tcc aga tgt ctg tgc caa gtg cta agc48
Met Gly Gly Glu His Arg Glu Ser Arg Cys Leu Cys Gln Val Leu Ser
1 5 10 15
tct gca act act ccg cat ccc cag aca atc ttc ggc gac aga cgt gct 96
Ser Ala Thr Thr Pro His Pro Gln Thr Ile Phe Gly Asp Arg Arg Ala
20 25 30
ctg aac cag act gaa cca gac gct ggg ttg gcc ctc act ttg gct gta144
Leu Asn Gln Thr Glu Pro Asp Ala Gly Leu Ala Leu Thr Leu Ala Val
35 40 45
atg cat gtt gat gca ttt gtt gag att gcg gtc aat cta tat tat tat192
Met His Val Asp Ala Phe Val Glu Ile Ala Val Asn Leu Tyr Tyr Tyr
50 55 60
gct acg atg ttg tcc tct tca tgt gct gcg agc atg ttg tcg gac tgc240
Ala Thr Met Leu Ser Ser Ser Cys Ala Ala Ser Met Leu Ser Asp Cys
65 70 75 80
gtc tcc tcc agc ggc ctt ata ccg ggt ctc ttc tgg tgt cgt cct gtt288
Val Ser Ser Ser Gly Leu Ile Pro Gly Leu Phe Trp Cys Arg Pro Val
85 90 95
tgg ctc gat ttc ccc ata atc gcc ctc ttc cgc cct tcc gaa gct ctt336
Trp Leu Asp Phe Pro Ile Ile Ala Leu Phe Arg Pro Ser Glu Ala Leu
100 105 110
ttc ttg aca tca ttt ccc ggt tcc agg aac acc acc cat caa agc ttg384
Phe Leu Thr Ser Phe Pro Gly Ser Arg Asn Thr Thr His Gln Ser Leu
115 120 125
cct cgc ggc gcg cgc gtc ctt cct ggc acg ttt cct ttg tcg tgc aca432
Pro Arg Gly Ala Arg Val Leu Pro Gly Thr Phe Pro Leu Ser Cys Thr
130 135 140
ttt gcc tgg ctt gag aag acc gtt acg agg aac att gtc gta tat aat480
Phe Ala Trp Leu Glu Lys Thr Val Thr Arg Asn Ile Val Val Tyr Asn
145 150 155 160
atc cgg cgt tac atg atg ccg cct ttg acg gct ttg gct ggt atc ccc528
Ile Arg Arg Tyr Met Met Pro Pro Leu Thr Ala Leu Ala Gly Ile Pro
165 170 175
ctc aaa gtc gcc att ccc gct gct gca acg act cta gcc tac ctg aac576
Leu Lys Val Ala Ile Pro Ala Ala Ala Thr Thr Leu Ala Tyr Leu Asn
180 185 190
gcc aga tgg tce gtg tcg cac gat gtg gct ctc ggt cgg gcc ttc gct624
Ala Arg Trp Ser Val Ser His Asp Val Ala Leu Gly Arg Ala Phe Ala
195 200 205
cat aca gtc ttg cag ccg gcg ttg gcc gaa cgt aat gat cgc ctg aac672
His Thr Val Leu Gln Pro Ala Leu Ala Glu Arg Asn Asp Arg Leu Asn
210 215 220
ctc ttt tac ctc ctg gag tac tac gct ctc agc ccg aag cta gcc aac720
Leu Phe Tyr Leu Leu Glu Tyr Tyr Ala Leu Ser Pro Lys Leu Ala Asn
225 230 235 240
aac acc tgg att gta tac aat ggc cgc agt tgg acg ttc cac gag ggg768
Asn Thr Trp Ile Val Tyr Asn Gly Arg Ser Trp Thr Phe His Glu Gly
245 250 255
tac gag atg gta tta cgc tat ggc cac tgg ctc aag acg gtt cac ggg816
Tyr Glu Met Val Leu Arg Tyr Gly His Trp Leu Lys Thr Val His Gly
260 265 270
gtt aag ccc aag gag atc gtg gcc atg gac ttt atg aac tcg tcg acc864
Val Lys Pro Lys Glu Ile Val Ala Met Asp Phe Met Asn Ser Ser Thr
275 280 285
ttc atc ttc ttg atg ttc ggt ctc tgg agt atc ggt gcc gtg ccc gcc912
Phe Ile Phe Leu Met Phe Gly Leu Trp Ser Ile Gly Ala Val Pro Ala
290 295 300
ttt atc aac tac aac ttg agt ggc aag cct ctg acg cac tcg gtg aag960
Phe Ile Asn Tyr Asn Leu Ser Gly Lys Pro Leu Thr His Ser Val Lys
305 310 315 320
gcg tcc acc gca agg ttg ctg ttc gtc gat gag gat gtt cgg gag tgc 1008
Ala Ser Thr Ala Arg Leu Leu Phe Val Asp Glu Asp Val Arg Glu Cys
325 330 335
ttc ccg cag gaa cag ctg gat atc ttt acg tct ccg gat ttc cat gag 1056
Phe Pro Gln Glu Gln Leu AspIle Phe Thr Ser Pro Asp Phe His Glu
340345 350
gac aag ggt ccg atg act gtc gtc ttc ttt acc cca gat ctg gag gcg 1104
Asp Lys Gly Pro Met Thr Val Val Phe Phe Thr Pro Asp Leu Glu Ala
355 360365
cag att ctg cag acg cag ccg gtc cgt gag gat gac aga gcg cga cag 1152
Gln Ile Leu Gln Thr Gln Pro Val Arg Glu Asp Asp Arg Ala Arg Gln
370 375 380
ggc gtc atc cgc cgc gac atg gcc atc ttg atc tac acc agc ggc aca 1200
Gly Val Ile Arg Arg Asp Met Ala Ile Leu Ile Tyr Thr Ser Gly Thr
385 390 395 400
acg ggc ttg ccc aaa ccc gct att gtc agc tgg gcc aag tgt tac gcc1248
Thr Gly Leu Pro Lys Pro Ala Ile Val Ser Trp Ala Lys Cys Tyr Ala
405 410 415
ggc ggt tat ttt tcc ggc tct tat atg ggt ttg aaa cag tcc gat cga1296
Gly Gly Tyr Phe Ser Gly Ser Tyr Met Gly Leu Lys Gln Ser Asp Arg
420 425 430
ttc tac acg gtc cgc gag aat gat gcg act gcg gtt caa tac gtc ggt1344
Phe Tyr Thr Val Arg Glu Asn Asp Ala Thr Ala Val Gln Tyr Val Gly
435 440 445
gaa acc atg cgc tat ctt ctt gcc gtc cct ccc gag gtc gat ccc gtg1392
Glu Thr Met Arg Tyr Leu Leu Ala Val Pro Pro Glu Val Asp Pro Val
450 455 460
acg ggc gag gat ctt gac aag aag cac aat gta cga att atc ttt ggc1440
Thr Gly Glu Asp Leu Asp Lys Lys His Asn Val Arg Ile Ile Phe Gly
465 470 475 480
aac gga cta cgc ccg gac gtc tgg aat aag gtc aag gag cgt ttc aat1488
Asn Gly Leu Arg Pro Asp Val Trp Asn Lys Val Lys Glu Arg Phe Asn
485 490 495
atc cca act gta tgc gaa ttc tac gca tct aca gag ggt agc tcg gcc1536
Ile Pro Thr Val Cys Glu Phe Tyr Ala Ser Thr Glu Gly Ser Ser Ala
500 505 510
acg tgg aac tta tca tcg aac agc cac agt gcg ggt gct att ggt agg1584
Thr Trp Asn Leu Ser Ser Asn Ser His Ser Ala Gly Ala Ile Gly Arg
515 520 525
aat ggc gcc atc gcc aga ttt gtc ttc gaa cgt cgc cat gcc att gtt1632
Asn Gly Ala Ile Ala Arg Phe Val Phe Glu Arg Arg His Ala Ile Val
530 535 540
gct gta gac cat gag agt cag cag ccc tgg cgg gat ccc aag acg ggg1680
Ala Val Asp His Glu Ser Gln Gln Pro Trp Arg Asp Pro Lys Thr Gly
545550 555 560
ctt tgc aag gcg gtg cct cgg gga gaa ccg ggc gag ctg ctg ctt gct1728
Leu Cys Lys Ala Val Pro Arg Gly Glu Pro Gly Glu Leu Leu Leu Ala
565 570 575
ctc gat gcc aag gac act gaa gcc atg ttc cag ggt tac ttc aag aac1776
Leu Asp Ala Lys Asp Thr Glu Ala Met Phe Gln Gly Tyr Phe Lys Asn
580 585 590
aac aag gcg aca gaa gac aag atc atc cgc gat gta tta acc aag ggt1824
Asn Lys Ala Thr Glu Asp Lys Ile Ile Arg Asp Val Leu Thr Lys Gly
595 600 605
gac gcc tat ttc cgc act ggc gat atg att cgg tgg gac agc aat ggt1872
Asp Ala Tyr Phe Arg Thr Gly Asp Met Ile Arg Trp Asp Ser Asn Gly
610 615 620
ctg tgg tac ttc tct gat cgc atg ggc gat acc ttc cgg tgg agg agt1920
Leu Trp Tyr Phe Ser Asp Arg Met Gly Asp Thr Phe Arg Trp Arg Ser
625 630 635 640
gag aat gtt tcc acg agt gaa gtt gct gaa gta ctg gga gcg cac cct1968
Glu Asn Val Ser Thr Ser Glu Val Ala Glu Val Leu Gly Ala His Pro
645 650 655
gaa gtg cac gag gcc aac gtc tac ggc gtt gcc tta cct cac cac gac2016
Glu Val His Glu Ala Asn Val Tyr Gly Val Ala Leu Pro His His Asp
660 665 670
ggg cgt gct gga tgc gct gcc atc gtc ttt aga cat cag gcc cag aat2064
Gly Arg Ala Gly Cys Ala Ala Ile Val Phe Arg His Gln Ala Gln Asn
675 680 685
aca gac cct tcg tca ggg gtc att gac ccg tca ccc cag gtg ctt ggt2112
Thr Asp Pro Ser Ser Gly Val Ile Asp Pro Ser Pro Gln Val Leu Gly
690 695 700
gac gtt gca tcc tac gca ttg aag aac ctg ccc aaa tac gcg gtg ccc2160
Asp Val Ala Ser Tyr Ala Leu Lys Asn Leu Pro Lys Tyr Ala Val Pro
705710 715 720
atc ttc ctg cgc gtg acg cca gag atg cag gcg acg ggg aat aac aag2208
Ile Phe Leu Arg Val Thr Pro Glu Met Gln Ala Thr Gly Asn Asn Lys
725 730 735
caa cag aag cat gtc ctg caa aag gaa ggc gtg gat cct tcc aag gtg2256
Gln Gln Lys His Val Leu Gln Lys Glu Gly Val Asp Pro Ser Lys Val
740 745 750
aat gcc aaa gac aag cta tat tgg ctt cgg ggt gct acg tat gtg cca2304
Asn Ala Lys Asp Lys Leu Tyr Trp Leu Arg Gly Ala Thr Tyr Val Pro
755 760 765
ttc cag cag aag gac tgg gag agg ttg aat gcc ggg cag tca gtc aat2352
Phe Gln Gln Lys Asp Trp Glu Arg Leu Asn Ala Gly Gln Ser Val Asn
770 775 780
cac aat gag tcg ctg caa agc ttt gtc acg aga atc agc aca atc tga2400
His Asn Glu Ser Leu Gln Ser Phe Val Thr Arg Ile Ser Thr Ile
785 790 795
<210>22
<211>799
<212>PRT
<213>橙色紅曲菌(Monascus aurantiacus AS3.4384)
<400>22
Met Gly Gly Glu His Arg Glu Ser Arg Cys Leu Cys Gln Val Leu Ser
1 5 10 15
Ser Ala Thr Thr Pro His Pro Gln Thr Ile Phe Gly Asp Arg Arg Ala
20 25 30
Leu Asn Gln Thr Glu Pro Asp Ala Gly Leu Ala Leu Thr Leu Ala Val
35 40 45
Met His Val Asp Ala Phe Val Glu Ile Ala Val Asn Leu Tyr Tyr Tyr
50 55 60
Ala Thr Met Leu Ser Ser Ser Cys Ala Ala Ser Met Leu Ser Asp Cys
65 70 75 80
Val Ser Ser Ser Gly Leu Ile Pro Gly Leu Phe Trp Cys Arg Pro Val
85 90 95
Trp Leu Asp Phe Pro Ile Ile Ala Leu Phe Arg Pro Ser Glu Ala Leu
100 105 110
Phe Leu Thr Ser Phe Pro Gly Ser Arg Asn Thr Thr His Gln Ser Leu
115120 125
Pro Arg Gly Ala Arg Val Leu Pro Gly Thr Phe Pro Leu Ser Cys Thr
130 135 140
Phe Ala Trp Leu Glu Lys Thr Val Thr Arg Asn Ile Val Val Tyr Asn
145 150 155 160
Ile Arg Arg Tyr Met Met Pro Pro Leu Thr Ala Leu Ala Gly Ile Pro
165 170 175
Leu Lys Val Ala Ile Pro Ala Ala Ala Thr Thr Leu Ala Tyr Leu Asn
180 185 190
Ala Arg Trp Ser Val Ser His Asp Val Ala Leu Gly Arg Ala Phe Ala
195 200 205
His Thr Val Leu Gln Pro Ala Leu Ala Glu Arg Asn Asp Arg Leu Asn
210 215 220
Leu Phe Tyr Leu Leu Glu Tyr Tyr Ala Leu Ser Pro Lys Leu Ala Asn
225 230 235 240
Asn Thr Trp Ile Val Tyr Asn Gly Arg Ser Trp Thr Phe His Glu Gly
245 250 255
Tyr Glu Met Val Leu Arg Tyr Gly His Trp Leu Lys Thr Val His Gly
260 265 270
Val Lys Pro Lys Glu Ile Val Ala Met Asp Phe Met Asn Ser Ser Thr
275 280 285
Phe Ile Phe Leu Met Phe Gly Leu Trp Ser Ile Gly Ala Val Pro Ala
290 295 300
Phe Ile Asn Tyr Asn Leu Ser Gly Lys Pro Leu Thr His Ser Val Lys
305 310 315 320
Ala Ser Thr Ala Arg Leu Leu Phe Val Asp Glu Asp Val Arg Glu Cys
325 330 335
Phe Pro Gln Glu Gln Leu Asp Ile Phe Thr Ser Pro Asp Phe His Glu
340 345 350
Asp Lys Gly Pro Met Thr Val Val Phe Phe Thr Pro Asp Leu Glu Ala
355 360 365
Gln Ile Leu Gln Thr Gln Pro Val Arg Glu Asp Asp Arg Ala Arg Gln
370 375 380
Gly Val Ile Arg Arg Asp Met Ala Ile Leu Ile Tyr Thr Ser Gly Thr
385390 395 400
Thr Gly Leu Pro Lys Pro Ala Ile Val Ser Trp Ala Lys Cys Tyr Ala
405 410 415
Gly Gly Tyr Phe Ser Gly Ser Tyr Met Gly Leu Lys Gln Ser Asp Arg
420 425 430
Phe Tyr Thr Val Arg Glu Asn Asp Ala Thr Ala Val Gln Tyr Val Gly
435 440 445
Glu Thr Met Arg Tyr Leu Leu Ala Val Pro Pro Glu Val Asp Pro Val
450 455 460
Thr Gly Glu Asp Leu Asp Lys Lys His Asn Val Arg Ile Ile Phe Gly
465 470 475 480
Asn Gly Leu Arg Pro Asp Val Trp Asn Lys Val Lys Glu Arg Phe Asn
485 490 495
Ile Pro Thr Val Cys Glu Phe Tyr Ala Ser Thr Glu Gly Ser Ser Ala
500 505 510
Thr Trp Asn Leu Ser Ser Asn Ser His Ser Ala Gly Ala Ile Gly Arg
515 520 525
Asn Gly Ala Ile Ala Arg Phe Val Phe Glu Arg Arg His Ala Ile Val
530 535 540
Ala Val Asp His Glu Ser Gln Gln Pro Trp Arg Asp Pro Lys Thr Gly
545 550 555 560
Leu Cys Lys Ala Val Pro Arg Gly Glu Pro Gly Glu Leu Leu Leu Ala
565 570 575
Leu Asp Ala Lys Asp Thr Glu Ala Met Phe Gln Gly Tyr Phe Lys Asn
580 585 590
Asn Lys Ala Thr Glu Asp Lys Ile Ile Arg Asp Val Leu Thr Lys Gly
595 600 605
Asp Ala Tyr Phe Arg Thr Gly Asp Met Ile Arg Trp Asp Ser Asn Gly
610 615 620
Leu Trp Tyr Phe Ser Asp Arg Met Gly Asp Thr Phe Arg Trp Arg Ser
625 630 635 640
Glu Asn Val Ser Thr Ser Glu Val Ala Glu Val Leu Gly Ala His Pro
645 650 655
Glu Val His Glu Ala Asn Val Tyr Gly Val Ala Leu Pro His His Asp
660 665 670
Gly Arg Ala Gly Cys Ala Ala Ile Val Phe Arg His Gln Ala Gln Asn
675 680 685
Thr Asp Pro Ser Ser Gly Val Ile Asp Pro Ser Pro Gln Val Leu Gly
690 695 700
Asp Val Ala Ser Tyr Ala Leu Lys Asn Leu Pro Lys Tyr Ala Val Pro
705 710 715 720
Ile Phe Leu Arg Val Thr Pro Glu Met Gln Ala Thr Gly Asn Asn Lys
725 730 735
Gln Gln Lys His Val Leu Gln Lys Glu Gly Val Asp Pro Ser Lys Val
740 745 750
Asn Ala Lys Asp Lys Leu Tyr Trp Leu Arg Gly Ala Thr Tyr Val Pro
755 760 765
Phe Gln Gln Lys Asp Trp Glu Arg Leu Asn Ala Gly Gln Ser Val Asn
770 775 780
His Asn Glu Ser Leu Gln Ser Phe Val Thr Arg Ile Ser Thr Ile
785 790 79權(quán)利要求
1.一種桔霉素生物合成基因簇,其具有SEQ ID NO.1和SEQ ID NO.2所示的核苷酸序列,桔霉素生物合成基因簇共10個基因,具體為
(1)桔霉素的修飾基因,即ctnD,ctnE,ctnF,ctnG,ctnH,ctnI,orf2,orf3,orf4;
(2)桔霉素的調(diào)節(jié)基因,即ctnR1。
2.根據(jù)權(quán)利要求1所述的基因簇,其特征是,ctnD在序列1中的位置為396bp~2312bp,該基因的編碼序列為序列識別號3的核苷酸序列,編碼一多肽,該多肽具有氧化還原酶活性。
3.根據(jù)權(quán)利要求1所述的基因簇,其特征是,ctnE在序列1中的位置為2522bp~3464bp,該基因的編碼序列為序列識別號5的核苷酸序列,編碼一多肽,該多肽具有脫氫酶活性。
4.根據(jù)權(quán)利要求1所述的基因簇,其特征是,orf2在序列1中的位置為3985bp~4493bp,該基因的編碼序列為序列識別號7的核苷酸序列,編碼一多肽。
5.根據(jù)權(quán)利要求1所述的基因簇,其特征是,ctnF在序列2中的位置為1bp~1451bp,該基因的編碼序列為序列識別號9的核苷酸序列,編碼一多肽,該多肽具有變位酶活性。
6.根據(jù)權(quán)利要求1所述的基因簇,其特征是,orf3在序列2中的位置為1987bp~4274bp,該基因的編碼序列為序列識別號11的核苷酸序列,編碼一多肽。
7.根據(jù)權(quán)利要求1所述的基因簇,其特征是,ctnR1在序列2中的位置為4710bp~6952bp,該基因的編碼序列為序列識別號13的核苷酸序列,編碼一多肽,該多肽具有WD重復(fù)序列蛋白活性。
8.根據(jù)權(quán)利要求1所述的基因簇,其特征是,orf4在序列2中的位置為7088bp~7951bp,該基因的編碼序列為序列識別號15的核苷酸序列,編碼一多肽。
9.根據(jù)權(quán)利要求1所述的基因簇,其特征是,ctnG在序列2中的位置為8147bp~12633bp,該基因的編碼序列為序列識別號17的核苷酸序列,編碼一多肽,該多肽具有碳酸酐酶活性。
10.根據(jù)權(quán)利要求1所述的基因簇,其特征是,ctnH在序列2中的位置為13379bp~14698bp,該基因的編碼序列為序列識別號19的核苷酸序列,編碼一多肽,該多肽具有短鏈脫氫酶活性。
11.根據(jù)權(quán)利要求1所述的基因簇,其特征是,ctnI在序列2中的位置為14993bp~18255bp,該基因的編碼序列為序列識別號21的核苷酸序列,編碼一多肽,該多肽具有?;o酶A合成酶活性。
12.一種阻斷桔霉素合成的方法,其特征為以SEQ ID NO.1和SEQ ID NO.2為對象,通過基因敲除或置換、反義技術(shù)、RNA干擾技術(shù),對產(chǎn)桔霉素的菌株進行操作,缺失任何部分DNA序列,阻斷桔霉素的產(chǎn)生。
13.根據(jù)權(quán)利要求12所述的方法,其特征為以SEQ ID NO.1為對象,通過基因敲除技術(shù)缺失ctnD基因,阻斷桔霉素的產(chǎn)生。
14.根據(jù)權(quán)利要求12所述的方法,其特征為以SEQ ID NO.2為對象,通過基因敲除技術(shù)缺失ctnI基因,阻斷桔霉素的產(chǎn)生。
全文摘要
一種桔霉素生物合成基因簇,屬于微生物基因技術(shù)領(lǐng)域,本發(fā)明提供了與桔霉素生物合成相關(guān)的10個基因,即桔霉素的修飾基因ctnD,ctnE,ctnF,ctnG,ctnH,ctnI,orf2,orf3,orf4共9個基因;桔霉素的調(diào)節(jié)基因ctnR1,本發(fā)明所提供的基因、蛋白質(zhì)及其抗體可以用來查找和發(fā)展用于醫(yī)藥,工業(yè),農(nóng)業(yè)的化合物或蛋白質(zhì)。本發(fā)明還提供了阻斷桔霉素生物合成的途徑。
文檔編號C12N15/31GK101182525SQ200710156570
公開日2008年5月21日 申請日期2007年11月8日 優(yōu)先權(quán)日2007年11月8日
發(fā)明者李燕萍, 楊 許, 阮瓊芳, 追 涂 申請人:南昌大學(xué)