Recently I learned something about DNA that blew my mind, and in this thread, I'll attempt to blow your mind as well. Behold: Chargaff's 2nd Parity Rule for DNA N-Grams.
If you are into cryptography or reverse engineering, you should love this.
Thread:

DNA consists of four different 'bases', A, C, G and T. These bases have specific meaning within our biology. Specifically, within the 'coding part' of a gene, a triplet of bases encodes for an amino acid
Most DNA is stored redundantly, in two connected strands. Wherever there is an A on one strand, you'll find a T on the other one. And similarly for C and G:

T G T C A G T
A C A G T C A

(note how the other strand is upside down - this matters!)
If you take all the DNA of an organism (both strands), you will find equal numbers of A's and T's, as well as equal numbers of C's and G's. This is true by definition.
This is called Chargaff's 1st parity rule.
https://t.co/jD4cMt0PJ0
Strangely enough, this rule also holds per strand! So even if you take away the redundancy, there are 99% equal numbers of A/T and C/G * on each strand *. And we don't really know why.
This is called Chargaff's 2nd parity rule.
Lots of people have advanced theories for why the number of C's and G's should match up, but as yet no slam dunk explanation has been reported. But, hold on, things are about to get even weirder! https://t.co/hO3ybOdPrR
It turns out the rule also holds for N-grams of bases! That is, as long as you both 'complement' and 'reverse' them. So for N=1, %C and %G are equal.

For N=2, this says that percentage of CC (%CC) and %GG are also equal, as are %AG and %CT (complemented AND reversed) etc.
You can compare this to turning a book upside down and reading it back to front, and finding that all three-letter words occur with equal frequency before and after turning over the book.
For DNA triplets like 'AAA', this looks like this. Left in blue is frequency of 'AAA', the right orange bar shows the reverse complement 'TTT'. And so on for all other 31 triplets. The correspondence is stunning:
And here are the tiny tiny differences for each triplet, all smaller than 0.2%. Note that this plot shows data for _all_ known bacterial chromosomes:
So why is this the case? There are lots and lots of theories, but there is no consensus yet. And that is what makes it so super interesting!
At the very core of life hides a mystery, a mystery that is easy to research from a computer. And I hope that one day soon we'll know for sure what is going on!
/ends

More from Science

Localized Surface Plasmon Resonance - an overview | ScienceDirect Topics

https://t.co/mzS7vVSREJ

https://t.co/353PdAX2fa

https://t.co/3yBImjOdd4

In some cases, almost 100% of the light energy can be converted to the second harmonic frequency. These cases typically involve intense pulsed laser beams passing through large crystals, and careful alignment to obtain phase matching.
https://t.co/hXlo8qgkD0
Look like that they got a classical case of PCR Cross-Contamination.
They had 2 fabricated samples (SRX9714436 and SRX9714921) on the same PCR run. Alongside with Lung07. They did not perform metagenomic sequencing on the “feces” and they did not get


A positive oral or anal swab from anywhere in their sampling. Feces came from anus and if these were positive the anal swabs must also be positive. Clearly it got there after the NA have been extracted and were from the very low-level degraded RNA which were mutagenized from

The Taq.
https://t.co/yKXCgiT29w to see SRX9714921 and SRX9714436.
Human+Mouse in the positive SRA, human in both of them. Seeing human+mouse in identical proportions across 3 different sequencers (PRJNA573298, A22, SEX9714436) are pretty straight indication that the originals

Were already contaminated with Human and mouse from the very beginning, and that this contamination is due to dishonesty in the sample handling process which prescribe a spiking of samples in ACE2-HEK293T/A549, VERO E6 and Human lung xenograft mouse.

The “lineages” they claimed to have found aren’t mutational lineages at all—all the mutations they see on these sequences were unique to that specific sequence, and are the result of RNA degradation and from the Taq polymerase errors accumulated from the nested PCR process

You May Also Like

Ivor Cummins has been wrong (or lying) almost entirely throughout this pandemic and got paid handsomly for it.

He has been wrong (or lying) so often that it will be nearly impossible for me to track every grift, lie, deceit, manipulation he has pulled. I will use...


... other sources who have been trying to shine on light on this grifter (as I have tried to do, time and again:


Example #1: "Still not seeing Sweden signal versus Denmark really"... There it was (Images attached).
19 to 80 is an over 300% difference.

Tweet: https://t.co/36FnYnsRT9


Example #2 - "Yes, I'm comparing the Noridcs / No, you cannot compare the Nordics."

I wonder why...

Tweets: https://t.co/XLfoX4rpck / https://t.co/vjE1ctLU5x


Example #3 - "I'm only looking at what makes the data fit in my favour" a.k.a moving the goalposts.

Tweets: https://t.co/vcDpTu3qyj / https://t.co/CA3N6hC2Lq