Feeds:
Posts
Comments

Posts Tagged ‘medicine’

BBC_news_sperm_count

BBC news article published on the 18th March 2018. According to the article, men with low sperm counts are at a higher risk of disease/health problems. However, this is unlikely to be a causal relationship and more likely to be a spurious correlation. May even turn out to be the other way round due to “reverse causality”, a bias we encounter a lot in epidemiological studies. The following sounds more plausible (to me at least!): “Men with disease/health problems are likely to have low sperm counts” (likely cause: men with health problems tended to smoke more in general and this caused low sperm counts in those individuals).

As an enthusiastic genetic epidemiologist (keyword here: epidemiologist), I try to keep in touch with the latest developments in medicine and epidemiology. However, it is impossible to read all articles that come out as there is a lot of epidemiology and/or medicine papers published daily (in fact, too much!). For this reason, instead of reading the original academic papers (excluding papers in my specific field), I try to skim read from reputable news outlets such as the BBC, The Guardian and Medscape (mostly via Twitter). However, health news even in these respectable media outlets are full of wrong and/or oversensationalised titles: they either oversensationalise what the scientist has said or take the word of the scientist they contact – who are not infallible and can sometimes believe in their own hypotheses too much.

It wouldn’t harm us too much if the message of an astrophysics related publication is misinterpreted but we couldn’t say the same with health related news. Many people take these news articles as gospel truth and make lifestyle changes accordingly. Probably the best example for this is the Andrew Wakefield scandal in 1998 – where he claimed that the MMR vaccine caused autism and gastro-intestinal disease but later investigations showed that he had undeclared conflicts of interest and had faked most of the results (click here for a detailed article in the scandal). Many “anti-vaccination” (aka anti-vax) groups used his paper to strengthen their arguments and – although now retracted – the paper’s influence can still be felt today as many people, including my friends, do not allow their children to be vaccinated as they falsely think they might succumb to diseases like autism because of it.

The first thing we’re taught in our epidemiology course is “correlation does not mean causation.” However, a great deal of epidemiology papers published today report correlations (aka associations) without bringing in other lines of evidence to provide evidence for a causal relationship. Some of the “interesting ones” amongst these findings are then picked up by the media and we see a great deal of news articles with titles such as “coffee causes cancer” or “chocolate eaters are more successful in life”. There have been instances when I read the opposite in the same paper a couple of months later (example: wine drinking is protective/harmful for pregnant women). The problem isn’t caused only due to a lack of scientific method training on the media side, but also due to health scientists who are eager to make a name for themselves in the lay media without making sure that they have done everything they could to ensure that the message they’re giving is correct (e.g. triangulating using different methods). As a scientist who analyses a lot of genetic and phenotypic data, it is relatively easier for me to observe that the size of the data that we’re analysing has grown massively in the last 5-10 years. However, in general, we scientists haven’t been able to receive the computational and statistical training required to handle these ‘big data’. Today’s datasets are so massive that if we take the approach of “let’s analyse everything we got!”, we will find a tonne of correlations in our data whether they make sense or not.

To provide a simple example for illustrative purposes: let’s say that amongst the data we have in our hands, we also have each person’s coffee consumption and lung cancer diagnosis data. If we were to do a simple linear regression analysis between the two, we’d most probably find a positive correlation (i.e. increased coffee consumption means increased risk of lung cancer). 10 more scientists will identify the same correlation if they also get their hands on the same dataset; 3 of them will believe that the correlation is worthy of publication and submit a manuscript to a scientific journal; and one (other two are rejected) will make it past the “peer review” stage of the journal – and this will probably be picked up by a newspaper. Result: “coffee drinking causes lung cancer!”

However, there’s no causal relationship between coffee consumption and lung cancer (not that I know of anyway :D). The reason we find a positive correlation is because there is a third (confounding) factor that is associated with both of them: smoking. Since coffee drinkers smoke more in general and smoking causes lung cancer, if we do not control for smoking in our statistical model, we will find a correlation between coffee drinking and lung cancer. Unfortunately, it is not very easy to eliminate such spurious correlations, therefore health scientists must make sure they use several different methods to support their claims – and not try to publish everything they find (see “publish or perish” for an unfortunate pressure to publish more in scientific circles).

cikolata_ve_nobel_odulu

A figure showing the incredible correlation between countries’ annual per capita chocolate consumption and the number of Nobel laureates per 10 million population. Should we then give out chocolate in schools to ensure that the UK wins more Nobel prizes? However, this is likely not a causal relationship as it makes more sense that there is a (confounding) factor that is related to both of them: (most likely) GDP per capita at purchasing power parity. To view even quirkier correlations, I’d recommend this website (by Tyler Vigen). Image source: http://www.nejm.org/doi/full/10.1056/NEJMon1211064.

As a general rule, I keep repeating to friends: the more ‘interesting’ a ‘discovery’ sounds, the more likely it is to be false.

Hard to explain why I think like this but I’ll try: for a result to sound ‘interesting’ to me, it should be an unexpected finding as a result of a radical idea. There are just so many brilliant scientists today that finding unexpected things is becoming less and less likely – as almost every conceivable idea arises and is being tested in several groups around the world, especially in well researched areas such as cancer research. For this reason, the idea of a ‘discovery’ has changed from the days of Newtons and Einsteins. Today, ‘big discoveries’ (e.g. Mendel’s pea experimets, Einstein’s general relativity, Newton’s law of motion) have given way to incremental discoveries, which can be as valuable. So with each (well-designed) study, we’re getting closer and closer to cures/therapies or to a full understanding of underlying biology of diseases. There are still big discoveries made (e.g. CRISPR-Cas9 gene editing technique), but if they weren’t discovered by that respective group, they probably would have been discovered within a short space of time by another group as the discoverers built their research on a lot of other previously published papers. Before, elite scientists such as Newton and Einstein were generations ahead of their time and did most things on their own, but today, even the top scientists are probably not too ahead of a good postdoc as most science literature is out there for all to read in a timely manner (and more democratic compared to the not-so-distant past) and is advancing so fast that everyone is left behind – and we’re all dependent on each other to make discoveries. The days of lone wolves is virtually over as they will get left behind those who work in groups.

To conclude, without carefully reading the scientific paper that the newspaper article is referring to – hopefully they’ve included a link/citation at the bottom of the page! – or seeking what an impartial epidemiologist is saying about it, it’d be wise to take any health-related finding we read in newspapers with a pinch of salt as there are many things that can go wrong when looking for causal relationships – even scientists struggle to make the distinction between correlations and causal relationships.

power_posing

Amy Cuddy’s very famous ‘Power posing’ talk, which was the most watched video on the TED website for some time. In short, she states that if you give powerful/dominant looking poses, this will induce hormonal changes which will make you confident and relieve stress. However, subsequent studies showed that her ‘finding’ could not be replicated and she that did not analyse her data in the manner expected of a scientist. If a respectable scientist had found such a result, they would have tried to replicate their results; at least would have followed it up with studies which bring other lines of concrete evidence. What does she do? Write a book about it by bringing in anecdotal evidence at best and give a TED talk as if it’s all proven – as becoming famous (by any means necessary) is the ultimate aim for many people; and many academics are no different. Details can be found here. TED talk URL: https://www.ted.com/talks/amy_cuddy_your_body_language_shapes_who_you_are

PS: For readers interested in reading a bit more, I’d like to add a few more sentences. We should apply the below four criteria – as much as we can – to any health news that we read:

(i) Is it evidence based? (e.g. supported by a clinical trial, different experiments) – homeopathy is a bad example in this regard as they’re not supported by clinical trials, hence the name “alternative medicine” (not saying they’re all ineffective and further research is always required but most are very likely to be);

(ii) Does it make sense epidemiologically? (e.g. the example mentioned above i.e. the correlation observed between coffee consumption and lung cancer due to smoking);

(iii) Does it make sense biologically? (e.g. if gene “X” causes eye cancer but the gene is only expressed in the pancreatic cells, then we’ve most probably found the wrong gene)

(iv) Does it make sense statistically? (e.g. was the correct data quality control protocol and statistical method used? See figure below for a data quality problem and how it can cause a spurious correlation in a simple linear regression analysis)

graph-3

Wrong use of a statistical (linear regression) model. If we were to ignore the outlier data point at the top right of the plot, it becomes easy to see that there is no correlation between the two variables on the X and Y axes. However, since this outlier data point has been left in and a linear regression model has been used, the model identifies a positive correlation between the two variables – we would not have seen that this was a spurious correlation had we not visualised the data.

PPS: I’d recommend reading “Bad Science” by Ben Goldacre and/or “How to Read a Paper – The basics of evidence based medicine” by Trisha Greenhalgh – or if you’d like to read a much better article on this subject with a bit more technical jargon, have a look this highly influential paper by Prof. John Ioannidis: Why Most Published Research Findings Are False.

References:

Wakefield et al, 1998. Ileal-lymphoid-nodular hyperplasia, non-specific colitis, and pervasive developmental disorder in children. The Lancet. URL: http://www.thelancet.com/journals/lancet/article/PIIS0140-6736%2897%2911096-0/abstract

Editorial, 2011. Wakefield’s article linking MMR vaccine and autism was fraudulent. BMJ. URL: http://www.bmj.com/content/342/bmj.c7452

Read Full Post »

Laws regarding first-cousin marriage around the world. Navy blue: First-cousin marriage legal. Light Blue: Allowed with restrictions or exceptions. Yellow: Legality dependent on religion or culture. Red: Statute bans first-cousin marriage. Pink: Banned with exceptions. Dark Red: Criminal offense. Grey: No available data. The image has been released into the public domain by the author (URL: http://en.wikipedia.org/wiki/Cousin_marriage).

Laws regarding first-cousin marriage around the world. Navy blue: First-cousin marriage legal. Light Blue: Allowed with restrictions or exceptions. Yellow: Legality dependent on religion or culture. Red: Statute bans first-cousin marriage. Pink: Banned with exceptions. Dark Red: Criminal offense. Grey: No available data. The image has been released into the public domain by the author (URL: http://en.wikipedia.org/wiki/Cousin_marriage).

The answer is (studying) consanguinity (i.e. unions between relatives such as first-cousin marriages); and one cannot understand the complexity of the issue (and make ‘informed’ decisions) without reading the literature of these five apparently unconnected fields. It is fair to say that there is a degree of hostility towards consanguineous marriages in Western societies. However this perception is usually attained without in-depth knowledge on the genetic effects of consanguinity. In short, consanguinity per se (i.e. on its own) does not cause a disorder, but rather it increases the probability of an autosomal recessive disorder (which require two copies of the same) causal mutation to be in a homozygous state (i.e. possess two copies of the same mutation). When this happens both copies of the genes we inherited from our parents do not function properly.

Unions between individuals who are second-cousins or closer are considered ‘consanguineous’ in clinical genetics. Consanguineous families with diseases have been studied thoroughly by clinical geneticists for the last two-three decades – and this has allowed for identification of many disease causal genes. However, studying consanguineous populations as a whole rather than ‘cherry picking’ families with disease can offer much more for better understanding our genome and therefore finding new targets for preventive and curative medicine. Many genes in our genome still have unknown functions and we have merely scratched the surface in terms of their interactions. I hypothesise that assigning a function to the thousands of remaining genes will only be feasible if consanguineous populations are studied as a whole (i.e. also including families without disease to the studies) and I therefore carry out theoretical studies to estimate the sample size needed and how many genes will be completely ‘knocked-out’ if these studies were to be carried out. This approach proposes a ‘paradigm shift’ in clinical genetics.

Global prevalence of consanguineous unions. Consanguinity has deep roots in many cultures and it is impossible to interfere/intervene from the outside without first understanding why people engage in cousin marriages. Image source URL: www.consang.net/

Global prevalence of consanguineous unions. Consanguinity has deep roots in many cultures and it is impossible to interfere/intervene from the outside without first understanding why people engage in cousin marriages. Image source URL: http://www.consang.net/

Consanguineous unions occur very rarely in Western countries for a variety of sociological (e.g. cultural, negative media coverage) and statistical reasons (e.g. smaller families means fewer cousins at similar age), but the complete opposite is true in certain regions of the world where union of kin is seen as the default choice. Conservative estimates predict that approximately one-sixth of the world’s population (a figure of 1.1 billion is proposed by the Geneva International Consanguinity Workshop Report) live in highly consanguineous regions; and also another one-sixth falls into the ‘unknown’ category – reflecting the need for further research. Historically, consanguineous unions were also common amongst the elite in the UK (up to mid-19th century, including Charles Darwin), the Pharaohs and the Royal families of Europe (e.g. Habsburgs).

Views of main religions towards consanguineous marriages. NB: Where first-cousin marriages are allowed, lower levels of consanguinity are also allowed. Image Source: Copy-pasted from my own PhD thesis

The increase in the probability of a mutation being homozygous will depend on the level of relatedness between the parents. For example, approximately 6.25% of mutations are expected to be homozygous in the offspring of first cousins. This figure would be (near) 0% in the offspring of outbred individuals. Genetically, this is the main difference between union of kin and union of unrelated individuals. We all have many disease-causal mutations in our genomes (but in heterozygous state, i.e. one normal copy and one mutated copy) and different kinds of mutations are out there in all populations. However because these mutations will be very rare or are unique to you or your family, they do not get to meet their counterpart when you have offspring with an unrelated individual. Therefore the mutation’s homozygous effects are never observed. This is why rare autosomal recessive disorders are almost always seen in consanguineous offspring.

This difference in homozygosity levels is also one of the main reasons behind the necessity of studying consanguineous individuals and populations. These studies can turn unfortunate events (e.g. disorder in families) to a great use for medical sciences. Not only will identifying a disease-causal mutation help with diagnostics, they can enable scientists to understand what certain genes do and help us understand why the gene causes that disease. Rare instances can be highly informative about preventable outcomes relevant to the whole population. For example, had more notice been taken in the 1980s of the proof which familial hypercholesterolemia provided for the causal role of cholesterol in coronary heart disease (CHD), high cholesterol intake would have been better addressed for the nation a decade sooner. To provide numbers, CHD is still the UK’s biggest killer causing over 80 thousand deaths a year, thus paying more attention to information that was coming from studies of consanguineous unions could have saved thousands of lives just in this single case.

Given the advancements in genetic diagnostics (e.g. huge decreases in costs of DNA sequencing), screening for all known mutations will become feasible in the near future for everybody – and identifying disease-causal mutations will become even more useful for all of us. Our genomes are constantly being mutated and my approach will enable a much broader understanding of our genome by observing these mutations’ homozygous effects. Finally, rather than discourage (See link for an example) consanguineous marriages totally (not feasible in the foreseeable future due to many socio-economic and cultural reasons), for those willing to marry a cousin, screening for previously identified mutations will help these couples take more informed decisions.

consanguinity factors culture socio-economic

Factors influenced by consanguinity and culture. Image Source: Copy-pasted from my own PhD thesis (hence the Figure 1.10)

Key reference:

A. Mesut Erzurumluoglu, 2016. Population and family based studies of consanguinity: Genetic and Computational approaches. PhD thesis. University of Bristol.

Erzurumluoglu AM et al, 2016. Importance of Genetic Studies in Consanguineous Populations for the Characterization of Novel Human Gene Functions. Annals of Human Genetics, 80: 187–196.

 

PS: Whilst the media is mostly responsible for portraying consanguinity the way they understand (and with more contrast added on of course), they could be forgiven as the genetic effects of consanguinity is not fully understood amongst geneticists either, especially in the field of complex trait genetics – thus the extra incentive for studying them.

Read Full Post »

Mesut Erzurumluoglu
My Poster in front of David Wilson Library, University of Leicester, UK

The above lab photo was printed on a large billboard just in front of the David Wilson Library (during the refurbishment/renovations) at the University of Leicester (UoL) in 2011, and then in 2013. The photo was also used in the Biological Sciences sections of the 2011/12, 2012/13 and 2013/14 UoL undergraduate prospectuses – although I was a PhD student at the University of Bristol since January 2012. I was in the third (out of four) year of my PhD course but was still in the UoL prospectus as an ‘undergraduate’ 🙂

Me in University of Leicester Prospectus 2012/13

The university also included my views in the online version of the University of Leicester Biological Sciences prospectus and in a ‘Time management’ lecture:

Univ. of Leicester - Biological Sciences webpage
Univ. of Leicester – Biological Sciences webpage
Time management lecture
Apocryphal quote attributed to me: “It wasn’t that challenging, if you’re organised” was used in a ‘Time Management’ lecture (2016) given at the University of Leicester (by Dr. Alex Patel -see her comment below). Photo by Yasemin Alpdogan.

Read Full Post »