Gene Composer database software for protein construct design, codon engineering, and gene synthesis.doc
《Gene Composer database software for protein construct design, codon engineering, and gene synthesis.doc》由会员分享,可在线阅读,更多相关《Gene Composer database software for protein construct design, codon engineering, and gene synthesis.doc(22页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、 BMCBiotechnologyBioMedCentralSoftwareOpen AccessGene Composer: database software forprotein construct design,codon engineering, andgene synthesis1,2,3,AmyRaymond ,John Walchli3,4,Mark Mixon3,4,1,2DonLorimerAdrienne Barrow1,3,Ellen Wallace1,3,Rena Grice1,3,Alex Burgin1andLance Stewart*1,2,3Address:
2、1deCODE biostructures, Inc7869 NEDayRoad West, Bainbridge Island, WA,98110, USA, 2Seattle Structural Genomics Center forInfectious Disease, Bainbridge Island, WA,98110, USA, 3Accelerated Technologies Center for Geneto3DStructure, Bainbridge Island, WA,98110, USAand 4Emerald BioSystems, Inc7869 NEDay
3、RoadWest, Bainbridge Island, WA, 98110, USAE-mail: DonLorimer -dlorimerdecode ; AmyRaymond -araymonddecode ; JohnWalchli -jwalchlidecode ;Mark Mixon -mmixondecode ; Adrienne Barrow -abarrowdecode ; Ellen Wallace -ewallacedecode ;RenaGrice -rgricedecode ; AlexBurgin -aburgindecode ; Lance Stewart* -l
4、stewartdecode *Corresponding authorPublished: 21April 2009Received: 16October 2008Accepted: 21April 2009BMCBiotechnology 2009, 9:36 doi: 10.1186/1472-6750-9-36This article isavailable from: :/ biomedcentral /1472-6750/9/362009Lorimer etal;licensee BioMed Central Ltd.ThisisanOpenAccess article distri
5、buted undertheterms oftheCreative Commons Attribution License ( :/creativecommons.org/licenses/by/2.0),which permits unrestricted use,distribution, andreproduction inanymedium, provided theoriginal workisproperly cited.AbstractBackground: Toimprove efficiency inhighthroughput protein structure deter
6、mination, wehavedeveloped a database software package, Gene Composer, which facilitates the information-richdesign of protein constructs and their codon engineered synthetic gene sequences. With itsmodular workflow design and numerous graphical user interfaces, Gene Composer enablesresearchers toper
7、form allcommon bio-informatics steps usedinmodern structure guided proteinengineering and synthetic gene engineering.Results: An interactive Alignment Viewer allows the researcher to simultaneously visualizesequence conservation in the context of known protein secondary structure, ligand contacts, w
8、atercontacts,crystalcontacts, B-factors,solventaccessiblearea, residueproperty type and severalotheruseful property views. The Construct Design Module enables the facile design of novel proteinconstructswithalteredN-andC-termini,internalinsertionsordeletions,pointmutations,anddesiredaffinitytags.The
9、modificationscanbecombinedandpermutedintomultipleproteinconstructs,andthenvirtuallyclonedinsilicointodefinedexpressionvectors.TheGeneDesignModuleusesaprotein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codonengineerednucleicacidgenesequenceaccordingt
10、oaselectedcodonusagetablewithminimalcodonusagethreshold,definedG:C%content,anddesiredsequencefeaturesachievedthroughsynonymouscodonselectionthatisoptimizedfortheintendedexpressionsystem.Thegene-to-oligoalgorithmoftheGeneDesignModuleplansoutalloftherequiredoverlappingoligonucleotidesandmutagenicprime
11、rsneededtosynthesizethedesiredgeneconstructsbyPCR,andforphysicallycloningthemintoselectedvectorsbythemostpopularsubcloningstrategies.Conclusion: We present acomplete descriptionof Gene Composer functionality,and an efficientPCR-basedsyntheticgeneassemblyprocedurewithmis-matchspecificendonucleaseerro
12、rcorrectionincombinationwithPIPEcloning.InasistermanuscriptwepresentdataonhowGeneComposerdesignedgenesandproteinconstructscanresultinimprovedproteinproductionforstructuralstudies.Page 1of22(page number notfor citation purposes) BMCBiotechnology 2009, 9:36 :/ biomedcentral /1472-6750/9/36Backgroundco
13、nstructs, guided by2Dand 3Dinformation, while thecorresponding nucleic acid sequences are engineered forboth codon usage and other desired sequence features.Gene Composer also enables the virtual cloning of thedesigned gene constructs which, depending on userpreferences, can be parsed into data file
14、s for onlineordering of complete genes or overlapping oligonucleo-tides that can be used for PCR-based gene assembly inany standard molecular biology lab. Gene Composeroperates within the Windows operating system andutilizes a network based SQL server or Access databasethat is populated by users as
15、they design genes. Thisarrangement makes it possible for multiple users to goback after time to design new construct variants thatimprove on existing designs by inclusion of newsequence or structural information from internationalgenome sequencing and structural genomics efforts. Inthis report we de
16、scribe how the synthetic gene designmodules of Gene Composer facilitate protein constructengineering for structural studies, codon engineering forheterologous protein production, and oligonucleotideplanning for PCR-based gene assembly with mismatchendonuclease error correction.Large-scale projects i
17、n genomic sequencing and proteinstructure determination are producing enormous quan-tities of data on the relationships between 2D genesequence and 3D protein structure. Moreover, suchefforts areproviding experimental dataonsuccess factorsat every step in the gene to structure research endeavor.Idea
18、lly, this wealth of information should be used in afeedback cycle to facilitate the design and production ofgenes and protein constructs that are optimized for thesuccessful production of functional protein samples forstructural studies. Fundamentally, this goal represents abioinformatics software c
19、hallenge. With the goal ofimproving yield and success rates of heterologousprotein production for structural studies, we havedeveloped Gene Composer, adatabase software packagewhich facilitates the information-rich design of proteinconstructs and their codon engineered synthetic genesequences.The re
20、dundancy of the genetic code allows any givenprotein tobeencoded byaverylargenumber ofpossiblesynonymous gene sequences. On average, each aminoacid can be encoded by approximately three differentcodons (61 amino acid codons/20 amino acids). For atypical 100aminoacidprotein therewouldbe3100(51047) di
21、fferent possible coding sequences. The degen-eracyofthegenetic codetherefore allows thepressures ofnatural selection to simultaneously influence both DNAandRNAsequence features inaddition toprotein codingfunction. DNA sequence elements and folded RNAstructures are known to play significant roles in
22、geneexpression. As such, the overlapping information con-tained in a gene sequence can be significantly morecomplex than coding for a linear amino acid sequence.For example in the tryptophan operon of E. coli , themRNA can fold into one of two mutually exclusiveconformations that are a direct conseq
23、uence of trypto-phan availability 1. These alternate conformationsaffect mRNA stability and therefore alter the expressionof the encoded proteins. It is also well established thatcodon preferences between species, and often betweengene families within a given species, can vary 2,3.Therefore, some ge
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- Gene Composer database software for protein construct design codon engineering and synthesis desig
链接地址:https://www.taowenge.com/p-61748744.html
限制150内