Triple helix explained

In the fields of geometry and biochemistry, a triple helix (: triple helices) is a set of three congruent geometrical helices with the same axis, differing by a translation along the axis. This means that each of the helices keeps the same distance from the central axis. As with a single helix, a triple helix may be characterized by its pitch, diameter, and handedness. Examples of triple helices include triplex DNA,[1] triplex RNA,[2] the collagen helix,[3] and collagen-like proteins.

Structure

A triple helix is named such because it is made up of three separate helices. Each of these helices shares the same axis, but they do not take up the same space because each helix is translated angularly around the axis. Generally, the identity of a triple helix depends on the type of helices that make it up. For example: a triple helix made of three strands of collagen protein is a collagen triple helix, and a triple helix made of three strands of DNA is a DNA triple helix.

As with other types of helices, triple helices have handedness: right-handed or left-handed. A right-handed helix moves around its axis in a clockwise direction from beginning to end. A left-handed helix is the right-handed helix's mirror image, and it moves around the axis in a counterclockwise direction from beginning to end.[4] The beginning and end of a helical molecule are defined based on certain markers in the molecule that do not change easily. For example: the beginning of a helical protein is its N terminus, and the beginning of a single strand of DNA is its 5' end.

The collagen triple helix is made of three collagen peptides, each of which forms its own left-handed polyproline helix.[5] When the three chains combine, the triple helix adopts a right-handed orientation. The collagen peptide is composed of repeats of Gly-X-Y, with the second residue (X) usually being Pro and the third (Y) being hydroxyproline.[6]

A DNA triple helix is made up of three separate DNA strands, each oriented with the sugar/phosphate backbone on the outside of the helix and the bases on the inside of the helix. The bases are the part of the molecule closest to the triple helix's axis, and the backbone is the part of the molecule farthest away from the axis. The third strand occupies the major groove of relatively normal duplex DNA.[7] The bases in triplex DNA are arranged to match up according to a Hoogsteen base pairing scheme.[8] Similarly, RNA triple helices are formed as a result of a single stranded RNA forming hydrogen bonds with an RNA duplex; the duplex consists of Watson-Crick base pairing while the third strand binds via Hoogsteen base pairing.[9]

Stabilizing factors

The collagen triple helix has several characteristics that increase its stability. When proline is incorporated into the Y position of the Gly-X-Y sequence, it is post-translationally modified to hydroxyproline.[10] The hydroxyproline can enter into favorable interactions with water, which stabilizes the triple helix because the Y residues are solvent-accessible in the triple helix structure. The individual helices are also held together by an extensive network of amide-amide hydrogen bonds formed between the strands, each of which contributes approximately -2 kcal/mol to the overall free energy of the triple helix. The formation of the superhelix not only protects the critical glycine residues on the interior of the helix, but also protects the overall protein from proteolysis.

Triple helix DNA and RNA are stabilized by many of the same forces that stabilize double-stranded DNA helices. With nucleotide bases oriented to the inside of the helix, closer to its axis, bases engage in hydrogen bonding with other bases. The bonded bases in the center exclude water, so the hydrophobic effect is particularly important in the stabilization of DNA triple helices.

Biological role

Proteins

Members of the collagen superfamily are major contributors to the extracellular matrix. The triple helical structure provides strength and stability to collagen fibers by providing great resistance to tensile stress. The rigidity of the collagen fibers is an important factor that can withstand most mechanical stress, making it an ideal protein for macromolecular transport and overall structural support throughout the body.

DNA

There are some oligonucleotide sequences, called triplet-forming oligonucleotides (TFOs) that can bind to form a triplex with a longer molecule of double-stranded DNA; TFOs can inactivate a gene or help to induce mutations. TFOs can only bind to certain sites in a larger molecule, so researchers must first determine whether a TFO can bind to the gene of interest. Twisted intercalating nucleic acid is sometimes used to improve this process. Mapping of genome-wide TFO-TTS pairs by sequencing is a useful way to study the triplex forming DNA in the whole genome using oligo-library.

RNA

In recent years, the biological function of triplex RNA has become more studied. Some roles include increasing stability, translation, influencing ligand binding, and catalysis. One example of ligand binding being influenced by a triple helix is in the SAM-II riboswitch where the triple helix creates a binding site that will uniquely accept S-adenosylmethionine (SAM). The ribonucleoprotein complex telomerase, responsible for replicating the tail-ends of DNA (telomeres) also contains triplex RNA believed to be necessary for proper telomerase functioning.[11] The triple helix at the 3' end of the PAN and MALAT1 long-noncoding RNAs serves to stabilize the RNA by protecting the Poly(A) tail from deadenylation, which subsequently affect their functions in viral pathogenesis and multiple human cancers.[12] Additionally, RNA triple helices can stabilize mRNAs by formation of a poly(A) tail 3'-end binding pocket.[13]

Computational Tools

TDF (Triplex Domain Finder)

TDF is a python-based package [14] to predict RNA-DNA triplex formation potential. The software starts by enumerating the substrings between TFO and TTS and uses statistical tests to find out significant result compared to the background.

Triplexfpp

Triplexfpp[15] is based on deep learning methods. This python-based pipelines can help predict the most likely triplex-forming lncRNA. However since the lncRNA for training is limited, there is a long way to go before machine learning and deep learning methods can be applied.

Notes and References

  1. Book: Bernués J, Azorín F . Triple-Stranded DNA . Nucleic Acids and Molecular Biology . 9 . 1995 . 1–21 . Springer . Berlin, Heidelberg . 10.1007/978-3-642-79488-9_1 . 978-3-642-79490-2 .
  2. Buske FA, Mattick JS, Bailey TL . Potential in vivo roles of nucleic acid triple-helices . RNA Biology . 8 . 3 . 427–439 . May 2011 . 21525785 . 3218511 . 10.4161/rna.8.3.14999 .
  3. Book: Collagen: Primer in Structure, Processing and Assembly. Bächinger HP . 2005-05-03. Springer Science & Business Media. 9783540232728.
  4. Book: The molecules of life : physical and chemical principles. Kuriyan J, Konforti B, Wemmer D . 9780815341888. New York . Garland Science, Taylor & Francis Group . 779577263. 2012-07-25.
  5. Shoulders MD, Raines RT . Collagen structure and stability . Annual Review of Biochemistry . 78 . 929–958 . 2009 . 19344236 . 2846778 . 10.1146/annurev.biochem.77.032207.120833 .
  6. Fidler AL, Boudko SP, Rokas A, Hudson BG . The triple helix of collagens - an ancient protein structure that enabled animal multicellularity and tissue evolution . Journal of Cell Science . 131 . 7 . jcs203950 . April 2018 . 29632050 . 5963836 . 10.1242/jcs.203950 .
  7. Jain A, Wang G, Vasquez KM . DNA triple helices: biological consequences and therapeutic potential . Biochimie . 90 . 8 . 1117–1130 . August 2008 . 18331847 . 2586808 . 10.1016/j.biochi.2008.02.011 .
  8. Duca M, Vekhoff P, Oussedik K, Halby L, Arimondo PB . The triple helix: 50 years later, the outcome . Nucleic Acids Research . 36 . 16 . 5123–5138 . September 2008 . 18676453 . 2532714 . 10.1093/nar/gkn493 .
  9. Conrad NK . The emerging role of triple helices in RNA biology . Wiley Interdisciplinary Reviews. RNA . 5 . 1 . 15–29 . 2014 . 24115594 . 4721660 . 10.1002/wrna.1194 .
  10. Brodsky B, Persikov AV . Molecular structure of the collagen triple helix . Advances in Protein Chemistry . 70 . 301–339 . 2005-01-01 . 15837519 . 10.1016/S0065-3233(05)70009-7 . 9780120342709 . 20879450 .
  11. Theimer CA, Blois CA, Feigon J . Structure of the human telomerase RNA pseudoknot reveals conserved tertiary interactions essential for function . Molecular Cell . 17 . 5 . 671–682 . March 2005 . 15749017 . 10.1016/j.molcel.2005.01.017 . free .
  12. Brown JA, Bulkley D, Wang J, Valenstein ML, Yario TA, Steitz TA, Steitz JA . Structural insights into the stabilization of MALAT1 noncoding RNA by a bipartite triple helix . Nature Structural & Molecular Biology . 21 . 7 . 633–640 . July 2014 . 24952594 . 4096706 . 10.1038/nsmb.2844 .
  13. Torabi SF, Vaidya AT, Tycowski KT, DeGregorio SJ, Wang J, Shu MD, Steitz TA, Steitz JA . 6 . RNA stabilization by a poly(A) tail 3'-end binding pocket and other modes of poly(A)-RNA interaction . Science . 371 . 6529 . eabe6523 . February 2021 . 33414189 . 9491362 . 10.1126/science.abe6523 . 231195473 .
  14. Kuo CC, Hänzelmann S, Sentürk Cetin N, Frank S, Zajzon B, Derks JP, Akhade VS, Ahuja G, Kanduri C, Grummt I, Kurian L, Costa IG . 6 . Detection of RNA-DNA binding sites in long noncoding RNAs . Nucleic Acids Research . 47 . 6 . e32 . April 2019 . 30698727 . 10.1093/nar/gkz037 . 6451187 .
  15. Zhang Y, Long Y, Kwoh CK . Deep learning based DNA:RNA triplex forming potential prediction . BMC Bioinformatics . 21 . 1 . 522 . November 2020 . 33183242 . 7663897 . 10.1186/s12859-020-03864-0 . free .