• Depression is the most common mental health condition in the world, so understanding it has major consequences for a large number of people.
  • The intricacies of the condition are becoming clearer because of the development of wider surveys, granular scales and more rigorous analysis.
  • Scientists now have a better idea of what people experience and how their condition progresses over time.
  • Our World in Data's Saloni Dattani looks at how our understanding of depression has changed.

People often think of depression as a single, uniform condition – deep sadness and a loss of interest in the activities that someone usually enjoys. But depression is complicated and it’s difficult to define what it means in an objective way.

This is because depression is a condition of the mind: it is diagnosed based on people’s psychological symptoms and behavior, not from a brain scan or markers in their blood or DNA.

When we diagnose depression, we have to rely on people to recall their own symptoms. We have to trust that doctors will observe and probe their symptoms reliably. We have to analyze this information to understand what people with depression experience that other people don’t.

Our ability to do all of these things accurately has changed over time, and so has our understanding of depression.

This comes down to three factors.

First, many countries now screen for depression in the general population, not just in a subset of people who are seen by a small number of doctors. In many studies, researchers track patients over long periods of time to understand how the condition progresses.

Second, we use questionnaires and interviews that treat depression as a condition that can occur at different levels and change with time.

Third, we have better statistical tools to help us refine questionnaires and understand how symptoms are related to one another.

Let’s look at each of these factors in more detail before we explore how they have informed our understanding of depression.

  • Surveying depression in the general population
  • Measuring depression on levels
  • Analyzing depression with more rigour
  • Sleeping problems and low energy tend to be the most common symptoms of depression
  • People experience different combinations of symptoms
  • Depression is an umbrella condition which contains subtypes
  • The symptoms of depression tend to change over time
  • Episodes of depression can occur frequently
  • Episodes of depression can last a long time
  • Depression is complicated

Surveying depression in the general population

Depression traces back to a condition called ‘melancholia.’ The concept of melancholia itself shifted in meaning several times in history. In ancient Greek medicine, it referred to a general condition of sadness and fear. From the 16th century, it was generally considered a type of insanity, and symptoms such as delusions and suspicion became more of a focus in these descriptions. Some people with these symptoms would likely be diagnosed with schizophrenia today.

From the late 18th century onwards, these symptoms became less emphasized, while fatigue and distress became more central to the diagnosis. Over the same time, the word depression was increasingly used in descriptions of melancholia. Since the 20th century, melancholia has been the name given to a severe subtype of depression.1

In the 19th century in Britain, for example, the diagnosis of melancholia was mainly used to decide who to admit to asylums. It was diagnosed based on the judgment of individual physicians who used different methods, and many asylums used broad definitions of suicidality. Talking about death, drinking too much alcohol, refusing food, having thoughts of guilt or damnation, having a fear of persecution, and any kind of self-harm could all be considered suicidal tendencies.2

Different asylums in Britain used different classification systems. This meant that statistics were difficult to compare between regions and were controversial, even at the time. Asylums would only record the overall number of patients with each condition – they wouldn’t record the symptoms of individual patients.3 In contrast, in Imperial Germany, detailed information about asylum patients was collected using census cards.4

Some psychiatrists such as Emil Kraepelin and Philippe Pinel monitored people in asylums in a systematic way. They noted which symptoms they had and how their illness changed over long periods of time. With that information, they designed systems to classify people with disorders.5

For example, Kraepelin noticed that some people with psychosis also had periods of depression, while others did not. He called the former condition ‘manic depression’ (which we would now understand as bipolar disorder or depression) and the latter ‘dementia praecox’ (which we now understand as schizophrenia).

These kinds of classifications began to be applied at large in asylums across Europe.

At the same time, psychologists began to devise questionnaires to measure people’s symptoms empirically. They developed various scales and tested them with college students before applying them to adults in the general population. Large organizations such as the American Psychological Association developed criteria that could be used to diagnose patients in a standardized way.

Depression is a condition that is increasingly recognized and surveyed in the population as a whole.

Now, we collect data on depression from two sources. First, we have data on diagnoses made by doctors. In many countries, doctors inquire about people’s symptoms and how much they correspond to the criteria in the Diagnostic and Statistical Manual of Mental Disorders (DSM). They also use tests to rule out medical conditions, such as thyroid disorders, that result in similar symptoms. Second, we have data on the severity of depression. This data is collected from patients and the general population, using many different questionnaires and rating scales.6 But there are still gaps in our knowledge. Data is lacking especially in many poorer places around the world. Even within high-income countries, people with poorer health or severe depression are much less likely to respond to these community surveys or report their symptoms accurately. If we didn’t take this into account, we would underestimate the prevalence of depression in the population.7

Measuring depression on levels

The second reason that we understand depression more accurately now is that we ask more nuanced questions about symptoms.

Depression as measured by Kraepelin focused only on whether symptoms were present or absent. But the measures that are used for screening or diagnosis today go much further.8 Now, people are asked about how frequent or severe their symptoms are.

For example, if someone has trouble sleeping, how many days does that occur each week? If they feel guilt, how often do they feel guilty? How guilty do they feel?

Do they often think about things they did wrong a long time ago? Do they blame themselves for having depression, seeing it as punishment? Do they hear voices accusing them or see hallucinations that threaten them?

We can use more specific questions like this to rate each symptom on levels that relate to how frequent or severe they are. We can place possible answers on a range from, say, 0 to 3. A person’s scores across all the symptoms can then be added up to give a total, and we can use cut-offs to classify an episode as ‘mild’, ‘moderate’ or ‘severe’ depression.

This was a huge step forward. It meant we could find out if patients with mild depression tended to be different from those with severe depression. We could also find out if they responded differently to treatment.

Most importantly though, these ratings have allowed us to record subtle changes over time. Many of these scales were developed as antidepressants were discovered and psychotherapy became more widely used, and the scales were used to track how patients improved while on these treatments. With a crude diagnosis, it would be difficult to detect subtle improvements in patients.9

What this means is that the way we conceptualize depression today is not as a fixed condition, but as something that can occur at varying degrees and that can potentially resolve.

People might experience symptoms on some days but not others, and they might have symptoms that become milder with time. The questionnaires we use probe symptoms in a way that measures how they might change, and how they might be treatable.

Analyzing depression with more rigour

We’ve seen how depression has been screened in the wider population and how it is measured in a more nuanced way.

But how would we find out whether these questionnaires were actually measuring depression? How would we know that people’s symptoms were being interpreted in a consistent way?

In the 19th and 20th centuries, statisticians began to develop techniques to measure many aspects of questionnaires, so they could refine them. They were interested in understanding how consistently the questions measured the same underlying concept.

They were also concerned with testing how people’s responses on a questionnaire reflected their behavior in the real world. They could do this, for example, by estimating how their scores on a depression questionnaire correlated with their ability to work or study.

They began to put more focus into understanding how much we could rely on the judgment of individual doctors, instead of treating them as infallible experts.

This was important because patients were included in clinical trials based on their scores on these questionnaires. If doctors had wildly different methods of scoring patients on the questionnaires, it would be very hard to compare patients with each other. It would also be hard to compare patients’ scores at the start of a trial to their scores at the end.

Finally, they were interested in understanding how the symptoms of depression related to each other, and whether it was appropriate to simply add up people’s scores into a total. In the 20th century, statisticians developed methods such as factor analysis and item response theory that could be used to dig into these questions.10

These issues are explained in more detail in the box below. Until recently, it was difficult to apply some of these techniques in datasets that had a large number of people or questions because they required a lot of computational power.11

With all of that in mind, what do we know about depression now?

What are the challenges of measuring depression?

Content validity: We might want to know whether the questionnaires were measuring depression at all. How would we be sure of this?

We could first think about which emotions and behaviors we associate with depression – sadness, tiredness, guilt, a loss of energy, and so on – and we could make sure that the questionnaires ask about all of these emotions.

Criterion validity: We could also find out if peoples’ responses to the questions are correlated with other things we associate with depression.

For example, if someone is severely depressed, we might expect that they are also less sociable and that they perform poorly at work or school. We could test whether this is the case for people who scored high on a questionnaire for depression.

Discriminant validity: We could also test whether their responses are uncorrelated with things we don’t associate with depression, such as their height or intelligence.

Internal consistency: Another important thing we’d want to know is whether the questions were actually tapping into the same phenomenon.

Do all the questions measure depression or are some measuring other concepts instead? To test for this, we could look at how closely people’s answers to different questions align with each other.

One consequence of this focus is that scores shouldn’t also measure other concepts.

For example, some people have phases of depression, but also have phases of mania at other times (where they might feel highly irritable or excited). Some of these symptoms seem to be opposites of depression. So, these symptoms should be scored as part of separate concepts, such as a manic phase in bipolar disorder, if we wanted a scoring system with high internal consistency.

Inter-rater reliability: If multiple doctors could not come to agreement about how depressed patients were, despite using the same questionnaire, it would hardly be useful. It would make it very difficult to compare patients with each other, especially because many of them would only be seen by a single doctor before they received treatment.

Fortunately, there are straightforward ways to test this: we could measure how consistent different doctors were at diagnosing the same patients.

Latent trait measurement: Apart from the issues above, we’d also want to find out what the scores on a questionnaire tell us about the severity of depression.

For example, we might wonder if it is appropriate to simply add up the ratings across all the questions to get a total score. And we might wonder how the questions are related to each other.

Latent class analysis: On the other side of the questions are the people who answer them. We might wonder if we can classify people who respond to the questionnaires into clusters based on how they respond. How different are these clusters, and do they matter? Answering these questions is the aim of latent class analysis, which was developed in the middle of the 20th century.

Sleeping problems and low energy tend to be the most common symptoms of depression

First of all, we know that some of the symptoms of depression are common in the population, even among those who do not suffer from depression.

You can see this in the chart below. It lists symptoms of depression and shows responses from the general US population. The questions come from the PHQ-9 (the patient health questionnaire), which is a popular rating scale for depression that is used around the world and covers all the symptoms in the DSM diagnostic criteria for major depression. 12

This questionnaire asks people about how often they have experienced symptoms of depression in the last two weeks. Half of the respondents (51%) had low energy, and more than one-third (37%) had trouble sleeping.13 Sleeping problems and low energy are also some of the most common symptoms in people who are diagnosed with moderate or severe depression.

a chart showing results of a questionnaire the different depressive symptoms across the total US population
Over half of respondents in this questionnaire experienced the depressive symptom of low energy.
Image: Our World in Data

People experience different combinations of symptoms

Even though some symptoms are common, depression is complicated. What makes it complicated is not the prevalence of individual symptoms, but rather the differences in the combinations of symptoms that patients experience. We see these combinations in the chart below, using symptoms in the Quick Inventory of Depressive Symptoms (QIDS-16), another questionnaire that covers all of the symptoms of depression in the DSM.14

this chart shows how people with depression experience different combinations of symptoms
Depression is complicated and can present as a combination of symptoms.
Image: Our World in Data

On the horizontal bars, you can see which symptoms patients have. In this chart, symptoms are marked as present if people experienced them to a moderate or severe degree.

The first thing to notice in this chart is that patients can experience many combinations of symptoms. What’s surprising is how uncommon each combination is. The most common combination is experienced by less than 2% of patients. The combinations of people’s symptoms vary widely.

The second thing to notice is that the most widely reported combination is, in fact, no symptoms at all. This is shown as ‘none’ in the chart. Many patients were diagnosed with moderate or severe depression but didn’t report having any of the individual symptoms to a moderate or severe degree. Instead, patients with the profile of “none” had all of the symptoms to a mild degree. This gave them a total score that was high, and meant they would still be considered to have moderate or even severe depression.

Think about the way that depression questionnaires are scored: each item is rated on a scale and the ratings across all the scales are added together.

This means that people can end up with a high total by rating many symptoms at a ‘mild’ level, or by rating some symptoms ‘severe’ and others ‘absent’, or something in between.

You might wonder if we should treat these different possibilities as equivalent to each other.

Is this a bug or a feature of questionnaires?

If it was a feature, it would imply that when people are mildly depressed, they would probably only experience a few symptoms, such as those that are common, like insomnia. Meanwhile, when people are more severely depressed, they would be more likely to experience more symptoms, including ones that are uncommon, like self-blame and suicidal thoughts.

In the 20th century, statisticians such as Georg Rasch, Jane Loevinger and Robert J Mokken developed models that could be used to test this assumption.

These models test whether people who have uncommon symptoms also have more symptoms overall. They also test whether the scores assigned to each level are appropriate. These assumptions seem to be valid for some questionnaires, including the PHQ-9 and the Hamilton Rating Scale for Depression.15 This means that adding up people’s scores on these questionnaires is an appropriate way to understand how severely they are depressed.

Depression is an umbrella condition which contains subtypes

We can find out how people’s responses to questionnaires are related to each other through a method called factor analysis.

Using this method, researchers have usually found that some symptoms are more likely to occur together. In other words, the symptoms are part of groups – they reflect underlying traits called ‘latent factors.’ The latent factors that researchers have usually found are sadness, guilt, changes in sleep and appetite, and despair about the future.16

These latent factors also correlate with each other to a moderate degree, and researchers often find an underlying “general dimension” of depression.

If you knew where someone placed on this general dimension, you would be able to roughly predict which symptoms they had and how many symptoms they had. But their specific symptoms would still be informative and important to understand.

To investigate this more, some researchers use a method called “latent class analysis.” What this does is it tries to identify whether people can be classified into subtypes based on how they respond to questionnaires.

Plenty of studies have used latent class analysis to understand whether there are subtypes of depression. But different studies have identified very different numbers of subtypes, which might be because they analyzed different sets of questions.

Some subtypes relate to how severely people experience depression, while others relate to which symptoms they have. But it is unclear what they mean, how valid some of them are, and how well they have been measured.17 For example, one subtype is called atypical depression, which usually refers to patients who sleep excessively and gain weight.

According to the DSM 5 criteria, people can only be diagnosed with atypical depression if they have ‘mood reactivity’, which refers to when someone can respond positively to good things that happen in their life.

This contrasts with another subtype of depression, called melancholic depression, where people have more severe sadness and find it difficult to feel happy even during these circumstances.

But there is lots of evidence that goes against the idea that mood reactivity is important for identifying people with atypical depression. In other words, people with symptoms of typical or atypical depression are equally likely to have mood reactivity.18

The different subtypes of depression are not stable over time. Studies that follow patients for long periods of time find that those who have multiple episodes of depression tend to fluctuate between having some episodes that are considered typical and others that are considered atypical or melancholic.19

The symptoms of depression tend to change over time

This brings us to another important fact about depression: the symptoms change over time. This is true for which symptoms people report as well as how severe they are.

charts showing that depression scores tend to decline over time
The symptoms of depression change over time.
Image: Our World in Data

You can see this in the chart, among people who are diagnosed with depression. Patients’ total scores declined over time after they were diagnosed. The curves of depression scores shift towards the left when patients are surveyed months after the initial questionnaire. This effect is found across many different questionnaires, as shown.

One explanation for this is that their symptoms become milder over time. For example, patients are likely to seek help when they have very severe symptoms, which would become milder later on. This is known as ‘regression to the mean.’ Another explanation is that there is a change in the combination of symptoms they report.20

Episodes of depression can occur frequently

Some people have episodes of depression frequently. We know this because researchers have regularly interviewed the same people over many years. This gives us more accurate results than relying on them to remember which symptoms they had in the past. One study tracked people’s symptoms over a period of more than thirteen years.21 It took the average person two to three years after their first depressive episode to have a year without depression. This measure – of how long it takes until the average person has a full year without depression – is known as the ‘median time to recovery’.

a chart showing how long is takes people to recover from depression
In this study, it took the average person two to three years after their first depressive episode to have a year without depression.
Image: Our World in Data

People’s experiences of depression varied widely. Half of the patients recovered within the same year. In contrast, a small proportion of patients (15%) did not recover even more than 20 years later; they had depression in every year of the study. You can see this in the chart.

a chart showing the share of people who have a recurrent episode of depression
People's experiences of depression recovery varied widely.
Image: Our World in Data

Even among those that recovered, almost half (45%) had another episode of depression during the period of the study.

What all of this tells us is that the condition can progress in different ways.

Some people experience episodes very infrequently: possibly only a single episode followed by years or decades without any recurrences. Others have episodes frequently.

Episodes of depression can last a long time

Apart from having episodes of depression frequently, some people also have episodes that last for years.

The DSM criteria requires patients to have symptoms of depression for at least two weeks before they can be diagnosed, but an episode of depression lasts around three months on average. 22 There is a lot of variation between people. When people have episodes that last over two years, their condition is labeled chronic depression.

a chart showing the age of onset of mental health disorders
There is a great deal of variation shown here.
Image: Our World in Data

Is there something different about these patients?

People who have chronic depression tend to be different from the average patient in several ways. They are more likely to also have other conditions, such as anxiety disorder, and more likely to have a family history of mental illness. They are also more likely to have had depression for the first time when they were young, typically under the age of 25.23

This is another way that depression is complex: it occurs at widely different times in people’s lives. The most likely age when people develop depression for the first time (the modal age of onset) is around 19. 24 But most patients develop it much later than this. In this meta-analysis, the median age of onset was around 31, while a quarter of people developed depression for the first time when they were older than 46 years old. There are large differences in the age of onset, as you can see in the chart.

Depression is complicated

How do we pull this knowledge together to understand depression as a whole?

One simple way to think about depression is to imagine episodes as a series of waves (shown below), where each episode lasts for weeks or months, before the episode ends.

a chart showing the cycles and stages of depression
The cycles and stages of depression.
Image: Our World in Data

For some people, this will be their only experience of depression for years, possibly ever. After a single wave of symptoms, they will have recovered.

For others, even after an episode of severe depression, they might still have a high baseline level of symptoms. Their level of “normalcy” might still be mild depression.

In this way, we can think of people’s symptoms getting progressively worse during an episode, improving, and potentially worsening again, resulting in a relapse.

Episodes of depression can last different lengths of time. Some people will have episodes that are mild, while others will have episodes that are severe. Many will have a combination of mild and severe episodes. Some will experience many episodes of depression, while others will only experience one episode or none at all.

What is the World Economic Forum doing about mental health?

One in four people will experience mental illness in their lives, costing the global economy an estimated $6 trillion by 2030.

Mental ill-health is the leading cause of disability and poor life outcomes in young people aged 10–24 years, contributing up to 45% of the overall burden of disease in this age-group. Yet globally, young people have the worst access to youth mental health care within the lifespan and across all the stages of illness (particularly during the early stages).

In response, the Forum has launched a global dialogue series to discuss the ideas, tools and architecture in which public and private stakeholders can build an ecosystem for health promotion and disease management on mental health.

One of the current key priorities is to support global efforts toward mental health outcomes - promoting key recommendations toward achieving the global targets on mental health, such as the WHO Knowledge-Action-Portal and the Countdown Global Mental Health

Read more about the work of our Platform for Shaping the Future of Health and Healthcare, and contact us to get involved.

Conclusion

Depression is the most common mental health condition in the world, so the intricacies of the condition have important consequences for a large number of people. Our understanding of depression influences how we diagnose the condition. It influences how we manage and treat it, and how we track changes in its levels over time.

There is a lot we already know about depression, but there is still a lot that is less clear.

Depression is a condition of the mind, which makes it difficult to study. We still have to rely on people to report their symptoms accurately. Studies have suggested there are many subtypes of depression, but we have limited information on how valid they all are and how to define them.

Depression is a condition of the mind, but that doesn’t mean it’s impossible to understand. With wider surveys, granular scales and more rigorous analysis, we’re uncovering just how complex this condition is. We’re able to find out how the symptoms are related to each other and how common they are.

We now know that people with depression experience different symptoms, with different levels of severity, at different times in their lives, and have episodes lasting different lengths of time. People’s symptoms vary and change over time and we can now measure how that happens.

In the following posts, we’ll explore more of these topics in detail. We’ll see how common depression is and how it varies between people.

We’ll look into antidepressant medication: how it is prescribed, how many people use it, and how effective it is. Finally, we’ll find out what we don’t know about depression and antidepressants, and why it has been difficult to answer many important questions about this condition that affects so many people worldwide.

Endnotes

1. Wallace, E. R., & Gach, J. (Eds.). (2008). History of psychiatry and medical psychology: With an epilogue on psychiatry and the mind-body relation. Springer.Telles-Correia, D., & Marques, J. G. (2015). Melancholia before the twentieth century: fear and sorrow or partial insanity?. Frontiers in Psychology, 6, 81. https://doi.org/10.3389/fpsyg.2015.00081

2. Jansson, Å. (2021). Statistics, Classification, and the Standardisation of Melancholia. In Å. Jansson, From Melancholia to Depression (pp. 123–171). Springer International Publishing. https://doi.org/10.1007/978-3-030-54802-5_5

See also: Shepherd, A. (2015). Institutionalizing the insane in nineteenth-century England. Routledge.

York, S. H. (2010). Suicide, lunacy and the asylum in nineteenth-century England (Doctoral dissertation, University of Birmingham).

3. McPherson, S., & Armstrong, D. (2021). Psychometric origins of depression. History of the Human Sciences, 095269512110090. https://doi.org/10.1177/09526951211009085

Jansson, Å. (2021). Statistics, Classification, and the Standardisation of Melancholia. In Å. Jansson, From Melancholia to Depression (pp. 123–171). Springer International Publishing. https://doi.org/10.1007/978-3-030-54802-5_5

See also: Shepherd, A. (2015). Institutionalizing the insane in nineteenth-century England. Routledge.York, S. H. (2010). Suicide, lunacy and the asylum in nineteenth-century England (Doctoral dissertation, University of Birmingham).

4. Engstrom, E. J. (2018). Clinical psychiatry in imperial Germany. Cornell University Press.

5. Andrews, J. (1998). Case notes, case histories, and the patient’s experience of insanity at Gartnavel Royal Asylum, Glasgow, in the nineteenth century. Social History of Medicine, 11(2), 255-281.

6. Data from questionnaires and rating scales are used for many purposes. One use is to guide treatment: some guidelines give different recommendations about how patients should be treated depending on how severely they are affected by depression. Another use is to measure levels of depression, which can be used in trials to understand how patients’ symptoms have changed over time.

7. Mars, B., Cornish, R., Heron, J., Boyd, A., Crane, C., Hawton, K., Lewis, G., Tilling, K., Macleod, J., & Gunnell, D. (2016). Using Data Linkage to Investigate Inconsistent Reporting of Self-Harm and Questionnaire Non-Response. Archives of Suicide Research, 20(2), 113–141. https://doi.org/10.1080/13811118.2015.1033121

Jousilahti, P. (2005). Total and cause specific mortality among participants and non-participants of population based health surveys: A comprehensive follow up of 54 372 Finnish men and women. Journal of Epidemiology & Community Health, 59(4), 310–315. https://doi.org/10.1136/jech.2004.024349Williams, D., & Brick, J. M. (2018). Trends in U.S. Face-To-Face Household Survey Nonresponse and Level of Effort. Journal of Survey Statistics and Methodology, 6(2), 186–211. https://doi.org/10.1093/jssam/smx019

8. People can rate themselves on screening tools, or have a non-professional ask them about their symptoms. These are usually questionnaires. In contrast, diagnostic tools are only used by medical professionals, who tend to use structured interviews to rate people on the symptoms of depression. Screening is usually used as a guide to determine who would benefit from a diagnosis.

9. McPherson, S., & Armstrong, D. (2021). Psychometric origins of depression. History of the Human Sciences, 095269512110090. https://doi.org/10.1177/09526951211009085

10. Price, P., Chiang, I. C. A., & Jhangiani, R. (2014). Research Methods in Psychology. Chapter 5: Psychological measurement. BCcampus, BC Open Textbook Project. https://opentextbc.ca/researchmethods/chapter/reliability-and-validity-of-measurement/

McPherson, S., & Armstrong, D. (2021). Psychometric origins of depression. History of the Human Sciences, 095269512110090. https://doi.org/10.1177/09526951211009085

Bech, P. (2012). Clinical psychometrics (1. ed). Wiley-Blackwell.

11. Bartholomew, D. J. (1995). Spearman and the origin and development of factor analysis. British Journal of Mathematical and Statistical Psychology, 48(2), 211–220. https://doi.org/10.1111/j.2044-8317.1995.tb01060.x

12. Manea, L., Gilbody, S., & McMillan, D. (2015). A diagnostic meta-analysis of the Patient Health Questionnaire-9 (PHQ-9) algorithm scoring method as a screen for depression. General Hospital Psychiatry, 37(1), 67–75. https://doi.org/10.1016/j.genhosppsych.2014.09.009

13. Tomitaka, S., Kawasaki, Y., Ide, K., Akutagawa, M., Yamada, H., Ono, Y., & Furukawa, T. A. (2018). Distributional patterns of item responses and total scores on the PHQ-9 in the general population: Data from the National Health and Nutrition Examination Survey. BMC Psychiatry, 18(1), 108. https://doi.org/10.1186/s12888-018-1696-9

14. Unlike this PHQ-9, the QIDS-16 questionnaire asks questions related to more specific symptoms of depression (for sleeping problems and movement). For example, while the PHQ-9 lumps together sleeping problems, the QIDS-16 has separate questions related to insomnia and hypersomnia.

15. Stochl, J., Fried, E. I., Fritz, J., Croudace, T. J., Russo, D. A., Knight, C., Jones, P. B., & Perez, J. (2020). On Dimensionality, Measurement Invariance, and Suitability of Sum Scores for the PHQ-9 and the GAD-7. Assessment, 107319112097686. https://doi.org/10.1177/1073191120976863

Gelaye, B., Williams, M. A., Lemma, S., Deyessa, N., Bahretibeb, Y., Shibre, T., Wondimagegn, D., Lemenhe, A., Fann, J. R., Vander Stoep, A., & Andrew Zhou, X.-H. (2013). Validity of the patient health questionnaire-9 for depression screening and diagnosis in East Africa. Psychiatry Research, 210(2), 653–661. https://doi.org/10.1016/j.psychres.2013.07.015

Carrozzino, D., Patierno, C., Fava, G. A., & Guidi, J. (2020). The Hamilton Rating Scales for Depression: A Critical Review of Clinimetric Properties of Different Versions. Psychotherapy and Psychosomatics, 89(3), 133–150. https://doi.org/10.1159/000506879.

16. Shafer, A. B. (2006). Meta‐analysis of the factor structures of four depression questionnaires: Beck, CES‐D, Hamilton, and Zung. Journal of clinical psychology, 62(1), 123-146.

17. Ulbricht, C. M., Chrysanthopoulou, S. A., Levin, L., & Lapane, K. L. (2018). The use of latent class analysis for identifying subtypes of depression: A systematic review. Psychiatry Research, 266, 228–246. https://doi.org/10.1016/j.psychres.2018.03.003

Harald, B., & Gordon, P. (2012). Meta-review of depressive subtyping models. Journal of Affective Disorders, 139(2), 126–140. https://doi.org/10.1016/j.jad.2011.07.015

18. Thase, M. E. (2009). Atypical Depression: Useful Concept, but it’s Time to Revise the DSM-IV Criteria. Neuropsychopharmacology, 34(13), 2633–2641. https://doi.org/10.1038/npp.2009.100

19. Łojko, D., & Rybakowski, J. (2017). Atypical depression: Current perspectives. Neuropsychiatric Disease and Treatment, Volume 13, 2447–2456. https://doi.org/10.2147/NDT.S147317

20. Fried, E. I., van Borkulo, C. D., Epskamp, S., Schoevers, R. A., Tuerlinckx, F., & Borsboom, D. (2016). Measuring depression over time . . . Or not? Lack of unidimensionality and longitudinal measurement invariance in four common rating scales of depression. Psychological Assessment, 28(11), 1354–1367. https://doi.org/10.1037/pas0000275

21. Eaton, W. W., Shao, H., Nestadt, G., Lee, B. H., Bienvenu, O. J., & Zandi, P. (2008). Population-Based Study of First Onset and Chronicity in Major Depressive Disorder. Archives of General Psychiatry, 65(5), 513. https://doi.org/10.1001/archpsyc.65.5.513

22. Eaton, W. W., Shao, H., Nestadt, G., Lee, B. H., Bienvenu, O. J., & Zandi, P. (2008). Population-Based Study of First Onset and Chronicity in Major Depressive Disorder. Archives of General Psychiatry, 65(5), 513. https://doi.org/10.1001/archpsyc.65.5.513

23. Hölzel, L., Härter, M., Reese, C., & Kriston, L. (2011). Risk factors for chronic depression—A systematic review. Journal of Affective Disorders, 129(1–3), 1–13. https://doi.org/10.1016/j.jad.2010.03.025

24. Solmi, M., Radua, J., Olivola, M., Croce, E., Soardo, L., Salazar de Pablo, G., Il Shin, J., Kirkbride, J. B., Jones, P., Kim, J. H., Kim, J. Y., Carvalho, A. F., Seeman, M. V., Correll, C. U., & Fusar-Poli, P. (2021). Age at onset of mental disorders worldwide: Large-scale meta-analysis of 192 epidemiological studies. Molecular Psychiatry. https://doi.org/10.1038/s41380-021-01161-7

25. Bockting, C. L., Hollon, S. D., Jarrett, R. B., Kuyken, W., & Dobson, K. (2015). A lifetime approach to major depressive disorder: the contributions of psychological interventions in preventing relapse and recurrence. Clinical psychology review, 41, 16-26.