Publication of two schizophrenia papers in @Nature this week by the @PGCgenetics and SCHEMA consortiums marks a new major milestone in the field of psychiatric genetics. A ๐งต on the key findings from these two landmark papers.
These papers were available as preprints since 2020, but Nature publishing them side by side is an important recognition to the whole psychiatric genetics field and deserves celebration.
I take this opportunity to reflect on how far we've come in understanding this devastating disease. Just to be clear, I'm involved in neither of these studies. All credits should go to the amazing people from the PGC and SCHEMA consortiums.
High heterogeneity in the phenotype presentations combined with natural selection actively removing the psychiatric risk variants from the gene pool have made gene discovery in the field of psychiatry near impossible with small to moderate sample sizes.
This sequence is true for both common variant (GWAS) and rare variant studies. In terms of common variants, the field has made a huge progress is building a sample size large enough to make discoveries all the way up to variant level resolution.
The current GWAS has gathered a sample size of ~76k cases and ~240k controls, which is truly an impressive achievement by the PGC.
nature.com/articles/s41586-022-04434-5
In line with that, it looks like we are already reaching saturation at the genome and gene-set level findings.
At the genome level, the common variant heritability is 0.24, of which ~10% (0.024) can be now explained using only genome-wide significant variants.
The large N has resulted in a polygenic score, which might not be predictive enough for clinical application, but will be of immense value for research e.g. to study treatment response, cognitive deficits, age of onset, rare variant penetrance etc.
At the gene-set level, it's abundantly clear that most of the schizophrenia risk genes are expressed throughout the brain, within the excitatory and inhibitory neurons and are involved predominantly in the pre and post synaptic pathways (I love the SynGO sunburst plots)
Note, the discoveries at the gene set level is now limited not by the GWAS N, but by the size and availability of the brain gene expression datasets at better resolution across time and space. PsychENCODE is making great progress in creating this catalog.
science.org/collections/psychencode
At the gene and variant level, the increasing N is continuing to yield more genes and variants. These results themselves might not make immediate sense, but in combination with rare variants they will bring extraordinary insights, which we will see in the second paper.
In the second paper, the authors report an exome wide rare variant association analysis in ~24k cases and ~97k controls (~1/3rd of current GWAS sample size) that led to the discovery of 10 schizophrenia risk genes.
nature.com/articles/s41586-022-04556-w
Similar to GWAS, the ExWAS discoveries also progress through the same sequence. Most of the past exome studies have been powered enough to bring insights only up to the gene set levels. With increasing N, we are now making discoveries at gene level.
One major finding that was abundantly clear in the earlier exome studies is that loss of function and deleterious missense variants associated with schizophrenia are enriched within the loss of function intolerant genes.
With the now increased same size, this finding has become stronger in terms of P value and gained better resolution in terms of effect size and standard error.
And by aggregating the protein truncating variants and deleterious missense variants within each gene, the authors identify at least 10 risk genes with high statistical confidence including the previously known risk gene SETD1A.
One finding that excites me the most in the current papers is at last, the rare and common variant findings are starting to converge for schizophrenia, not just at the gene set level, but also at the individual gene level.
At some of the loci, e.g at GRIN2A locus, this convergence is remarkable with formation of an allelic series where variants representing varying levels of gene perturbations increase the schizophrenia risk at proportional effect sizes. It's amazing to see we've come this far.
Finally, we are also gaining more insights into the genetic overlap between schizophrenia and ASD/neurodevelopmental disorders (NDDs). Previous schizophrenia exome studies have seen clear enrichment of schizophrenia risk variants within NDD genes.
The current sample size has enabled the authors to zoom in on individual genes to study the schizophrenia and NDD risk variants. It seems at least for some genes there is a separation at the variant types with denovo missense causing NDDs and inherited PTVs causing schizophrenia
and relatively less severe PTVs lead to adolescent/adulthood onset schizophrenia. But it is much more complicated than that. When we talk about these variants we subconsciously assume a 100% penetrance model.
Nevertheless, at a higher level, there seem to be some level of separation in terms of gene properties (e.g. pre/postnatal exp, strength of the constraints) between NDD and schizophrenia genes. As sample size grow, we'll get more clarity on this in the future.
To conclude, the field is making great progress in understanding the genetics of this devastating disease and these efforts will one day lead to development of newer drugs. Big congrats to all the authors from the PGC and SCHEMA consortiums ๐ ๐