Introduction Recombinant Protein
Recombinant protein is a modified form of protein, which is generated in many ways to produce large quantities of proteins, alter gene sequences and produce useful commercial products. The formation of recombinant protein is carried out in specialized vehicles known as vectors. Recombinant protein is a protein encoded by a gene — recombinant DNA — that has been cloned in a system that supports the expression of the gene and the translation of messenger RNA. Proteins co-expressed in bacteria will not have post-translational modifications, e.g. phosphorylation or glycosylation. Eukaryotic expression systems are required for this. Efficient strategies for the production of recombinant proteins are gaining increasing importance, as more applications that require high amounts of high-quality proteins reach the market.
Recombinant proteins have been utilized as tools for cellular and molecular biology. Today, more than 75 recombinant proteins are used as pharmaceuticals, and more than 360 new medicines based on recombinant proteins are under development. The impact of the production of recombinant proteins has also extended to the development of bioinsecticides, diagnostic kits, enzymes with multiple applications, and bioremediation processes, among many others. In particular, areas such as detergent production and food processing have been among the most notable success. Even when hundreds of proteins are produced at a commercial scale, the production of recombinant proteins still imposes a challenge in many cases. In this assignment, typical problems encounter during recombinant protein production are reviewed and strategies to solve them and increase productivity.
1. Loss of Expression
A necessary condition for sufficient recombinant protein production is the efficient expression of the gene of interest. However, expression can be lost due to structural changes in the recombinant gene or loss of the gene from host cells. Plasmid-Based Systems Plasmids are extrachromosomal self-replicating cytoplasmic DNA elements that are found in prokaryotes and eukaryotes. They have been used as molecular tools for recombinant genes since the dawn of genetic engineering.
Plasmid-based expression
It is the most popular choice when we use prokaryotes as hosts, as genetic manipulation of plasmids is quite easy. Furthermore, gene dose that normally depends on plasmid copy number is higher than when the recombinant gene is inserted into the host’s chromosome. Plasmid copy number is an inherent character of each expression system and normally depends on the plasmid, the host, and the culture conditions. Plasmid copy numbers can range from a few up to 200. The metabolic load increases when there is an increase in the size of the insert, temperature, level of expression, recombinant protein yield, and toxicity of the protein expressed toward the host. Such kind of metabolic load sometimes results in a decrease in the growth rate of plasmid-containing cells. Consequently, the growth rate decreases and faster-growing plasmid-free cells ultimately overtake the culture. Loss of plasmid is the main cause of reduced recombinant protein productivity in plasmid-based systems. This is called plasmid segregation instability. Another factor that adds to plasmid instability is plasmid multimerization. As plasmid copies have the same sequence, they can recombine forming a single dimeric circle that has two origins of replication. This results in lesser independent units being separated between daughter cells, and subsequently, plasmid loss can increase. In addition, cells that bear multimers grow slower than those bearing monomers, even at the same copy numbers. Other factors that influence plasmid stability are plasmid size (larger plasmids are less stable) the presence of foreign DNA, rate of cell growth, nutrient availability, temperature, and mode of culture. Several natural mechanisms exist to guarantee plasmid survival in cell populations. For example, low-copy-number plasmids assurance their persistence by multimer resolution through site-specific recombination systems or via active partition mechanisms, such as the par sequences. Genes responsible for both mechanisms have been incorporated in such man-made plasmids that increase their stability
Chromosomal Integration
Chromosomal integration of the gene of interest is a better alternative to overcome problems of expression stability in plasmid-based systems. Chromosome integration is particularly suitable for the metabolic engineering of the host. However, many disadvantages over plasmid-based systems exist for recombinant protein production. Appropriate integration of a foreign gene in the chromosome is labour-intensive and very time-consuming. But the recombinant cells obtained are able to grow in the absence of antibiotics with no reduction of recombinant protein yields. This approach also had the advantage of not infringing patents. Still, a major problem faced with chromosomal integration is the chance that the gene of interest will become inserted into an inactive region of chromatin. Among the many strategies used to overcome such a problem is the use of locus control regions (LCRs), which ensures transcriptional regulation of the transgene.
Viral Vectors
Another easy and very effective way of delivering the gene of interest is via viral vectors. Viruses have evolved to transfer their genetic material to the host in an efficient and non-destructive way. Some viral vectors, such as retroviruses, facilitates the integration of the viral genome into the cell’s chromosome. Many others are used for temporary expression. In these cases, recombinant protein production occurs only during particular stages of the life cycle of the virus. The simplicity of this expression method makes it useful for production in higher eukaryotes, as obtaining stable recombinant animal cells may be a tiring and long process. For example, the insect cell baculovirus expression vector system (BEVS) is used to commercially produce many recombinant proteins.
Keep Reading:
- Bioprocessing of VLPs used in vaccine production
- What is health system?
- What is pooled risk in terms of healthcare insurance system?
- 4 helpful tips to avoid toxic people in your life in 2021
- Tuberculosis in Pakistan and its impact on an individual’s life
- Biofloc fish farming in Pakistan
Watch Our Latest Videos:
- Naila Shamal | Bride from Mardan demanded Rs 100k worth of books in Haq Mehr
- Top 10 places to visit in Islamabad this year
Moreover, BEVS is especially feasible for the production of vaccines. Recombinant gene expression from viral vectors comprises special issues that are different from those of plasmid- or of chromosome-based systems. The use of viral vectors is involving a process with two different phases: first, cells are grown to the desired cell density, and then they are infected with the virus of interest. In addition, a virus-free product must be ensured for most applications; thus, particular considerations are needed during purification procedures. One of the most important confines of expression systems based on viral vectors is the quality of the viral stock. Serial in vitro passaging of stocks can result in the emergence of mutant viruses that are known as defective interfering particles (DIP). The genome of DIP has several deletions that make their replication faster than that of intact viruses. Therefore, DIP fights for the cellular machinery and can severely decrease recombinant protein yields. As DIP replication needs a helper virus, in this case, the complete virus, their amassing can be avoided by using multiplicities of infection (MOI) less than 0.1 plaque-forming unit (pfu) per cell.
Posttranslational Processing
Folding, Aggregation, and Solubility Protein folding is an intricate process in which two kinds of molecules play an important role: foldases, which speed up protein folding; and chaperones, which protect from the formation of non-native insoluble folding entities. On occasions, folding does not proceed properly. This results in misfolded proteins that gather in intracellular aggregates known as inclusion bodies. One of the main reasons for incorrect protein folding is cell stress that may be caused by heat shock, nutrient deficiency, or some other stimuli. Cells answer to stress by enhancing the expression of various chaperones, some of them of the hsp70 and hsp100 families. Production of inactive proteins shows an energetic exhaust and metabolic load while amassing of inclusion bodies can also cause structural strains to the cell. For instance, many human pathologies, such as Alzheimer’s disease, Parkinson’s disease, and Huntington’s disease, are characterized by intracellular protein aggregation and accumulation. During the production of heterologous protein, high rates of expression are needed. Accordingly, the formation of an inclusion body is more likely to occur during the production of recombinant protein. Aggregation saves proteins from proteolysis and can smooth the progress of protein recovery by simply breaking the cells and centrifuging the inclusion bodies. In addition, when the expressed protein is harmful to the host, its deleterious effect can be disallowed by producing the heterologous product as inclusion bodies. Accordingly, inclusion body formation not only is wanted, but also can be promoted through molecular biology and/or operation strategies, such as the use of protease-lacking strains, culturing at high temperatures, or designing suitable fusion peptides and amino acid sequences through protein engineering approaches. Protein engineering can also reduce aggregation changing the level of hydrophobic regions or using fusion proteins are two successful strategies
Proteolytic Processing
Signal peptides, required to direct proteins to the various cellular compartments, must be cleaved to get a functional protein. Upon membrane translocation, the signal peptide is detached by a signal peptidase complex that is membrane-bound to the endoplasmic reticulum in eukaryotes or to the cellular membrane in prokaryotes. Inappropriate removal of the signal, the peptide may result in protein accumulation and retention within incorrect compartments, such as the endoplasmic reticulum. Ultimately, the yields of secreted proteins can be significantly reduced. To solve this problem, the E. coli signal peptidase I and the Bacillus subtilis signal peptidase are being overexpressed in E. coli and insect cells.
Glycosylation
Glycosylation is a very complex posttranslational modification that needs several consecutive steps and involves many enzymes and substrates (Fig. 2)
It usually occurs in the endoplasmic reticulum and Golgi apparatus of eukaryotic cells, although N-glycosylation has been noticed in proteins produced by bacteria. Three types of glycosylation is there: N-(glycans linked to an Asn of an AsnXaaSer/Thr consensus sequence, where Xaa is any amino acid), O-(glycans linked to a Ser or Thr), and C (attached to a tryptophan) like sequence. N-linked glycosylation is the most studied and is thought as the most relative for recombinant protein production. In many cases, glycosylation tells about protein, folding, solubility, antigenicity, localization, biological activity, stability and circulation half-life. Improper Glycosylation may activate immune responses when present in proteins for human or animal use. Therefore, proper glycosylation is especially required for recombinant proteins to be utilized as drugs. First, the synthesis of the dolichol phosphate oligosaccharide can affect the level of glycosylation. On the other hand, a decreased pool of sugar nucleotides, the activated sugar donors are required for oligosaccharide synthesis, limits the formation of the G3M9N2Dol PP precursor (where G is glucose, M is mannose, and N is N-acetylglucosamine) and decreases the glycosylation site possession. To improve such a problem, sugar nucleotide precursors have been added to the culture medium.
2. Other Posttranslational Modifications
Other posttranslational modifications, such as phosphorylation, myristoylation, sulfation, palmitoylation, isoprenylation, C-terminal amidation, β-hydroxylation, and methylation, are less common than glycosylation but are important for certain recombinant proteins.
Transport and Localization
Recombinant proteins may be transferred to different cellular compartments by signal peptides or through fusion proteins. Intracellular amassing often results in high protein amounts and allows easy recovery of concentrated protein along with cells. Nevertheless, purification of the product from the protein-rich cell extract may be tedious. Accordingly, concentration procedures, such as ultrafiltration, are always used before other purification stages when working with secreted proteins. Aggregation in the periplasm often results in soluble and correctly folded proteins, whereas cytoplasmic localization results in an inactive and insoluble product. Small proteins are susceptible to proteolysis and should be produced in E. coli as inclusion bodies. Apart from intra- or extracellular accumulation, some applications may need recombinant proteins to be targeted to the cell membrane, usually via fusion proteins.