Skip to main content
Open AccessOriginal Article

Depression, Race-Based Bias, and Judgments of Veracity

Published Online:https://doi.org/10.1024/2673-8627/a000040

Abstract

Abstract:Introduction: The depressive realism hypothesis suggests that depressed individuals have a more realistic perception than nondepressed. Most studies depict the effects of depressive realism on self-perceptions. However, some lie detection studies suggest the effects of depressive realism on the perception of others. Our study investigated the hypothesis that individuals with subclinical depression levels (dysphoric) show a heightened accuracy in lie detection and a lower truth bias. Furthermore, we expected these effects to be stronger in positive statements. Moreover, we expected that nondysphoric but not dysphoric individuals would show a heightened truth bias for Black targets. We also expected to find the effects of depressive realism in judgmental confidence and in self-evaluations of performance. Methods: 472 participants classified 16 video statements as truth or lie. We tested all hypotheses using three depression measures: IPIP-300 Depression Subscale, the PHQ-9, and the CES-D. Results: In contrast to our hypothesis, we found no heightened accuracy in the veracity judgments of dysphoric individuals. Truth bias was higher in dysphoric participants than in nondysphoric. There was an interaction of valence with PHQ-9 but not with IPIP or CES-D. Nondysphoric but not dysphoric participants had a higher truth bias for Black targets. Furthermore, dysphoric individuals had lower judgmental confidence and lower but not more accurate self-evaluation values. Conclusion: We discuss the results and future directions.

Introduction

The classical view of depression suggests that depressed individuals have distorted views of themselves, the world, and the future (Beck, 1963, 1967). Especially negative self-view has been studied extensively, providing evidence for distorted self-views (Alloy & Ahrens, 1987; Beck, 1967; Gara et al., 1993; Oliver & McGee, 1982). However, some researchers propose that negative self-view might not be caused by distortion but rather by a more realistic perception of own abilities, suggesting that positively biased views of the self are related to mental health (Alloy & Abramson, 1979).

The Depressive Realism Hypothesis

Alloy and Abramson (1979) investigated differences in the judgment of contingency tasks in which the participants judge contingencies between the pressing of a button and the appearance of a light. They found evidence for their suggestion that depressed individuals do not underestimate the contingencies, as Seligman (1972) had previously proposed; rather, they have a more realistic perception than nondepressed individuals, who show an illusion of control (Blanco et al., 2011; Langer, 1975; Matute 1995; Yarritu et al., 2014), which the authors proposed as functioning as a self-serving bias. However, not all studies investigating the effect of depressive realism could replicate the results (Venkatesh et al., 2018; see also Moore & Fresco, 2012). In their meta-analytic review, Moore and Fresco found a small aggregated effect size, suggesting that the effect may occur more often in lower levels of depression (see also Ruehlman et al., 1985; Soderstrom et al., 2011). They also found the effect to be influenced by method factors, such as the utilization of self-report measures to assess depression, which increased the probability of finding depressive realism effects.

Besides contingency-judgment tasks, we investigated the effects of depressive realism on several other tasks, one of which was the self-evaluation of task performance, comparing task performance to self-evaluations. Gotlib and Meltzer (1987) found the effects of depressive realism in the self-evaluation of social competence by having participants rate their social competence in a previous interaction with another individual. Dennard and Hokanson (1986) also found less positively biased self-evaluations of performance in cognitive tasks for dysphoric individuals. A more recent study by Fu et al. (2012) found the effects of depressive realism in the self-evaluation of task performance of dysphoric individuals.

Deception Detection

In their meta-analysis, Bond and DePaulo (2006) summarized the results of previous studies about the ability of people to detect lies, which proved to be close to chance. The reasons for the poor ability to detect lies include people’s beliefs in the importance of clues that are mostly irrelevant to the detection of deception (DePaulo et al., 2003; Vrij, 2008) and the missing actual reliable clues of deception (DePaulo et al., 2003; Hartwig & Bond, 2011).

A common phenomenon in lie detection is truth bias, which describes the phenomenon that people tend to believe a message is truthful more often than deceitful (DePaulo et al., 1997). This effect aligns with Vrij’s (2008) suggestion that people do not want to believe that they are being lied to, and that it is not always in their best interest to detect lies. Researchers explain the phenomenon of truth bias with the rarity of lies and reasons to tell lies in everyday life (O’Sullivan et al., 1988; Vrij & Baxter, 1999).

Dysphoric Individuals’ Ability to Detect Lies

Lane and DePaulo (1999) reported evidence for the assumption that individuals suffering from low or moderate levels of depression (also called dysphoric) show greater accuracy than nondysphoric individuals at detecting false remarks meant to encourage a social counterpart, though there is no overall heightened accuracy among depressed individuals. This suggests that the accuracy depressed individuals show in detecting lies may be influenced by the emotional valence of the statement, specifically a heightened accuracy for positive statements.

Weightman et al. (2014) summarized previous investigations into the relationship between depression and the interpretation of emotional stimuli and concluded that the social cognition of depressed individuals might be influenced by a mood-congruent bias, suggesting a heightened accuracy at identifying negative emotions and a reduced accuracy for positive ones. In the lie-detection task, this could mean that depressed individuals might be more ready to believe in negative than in positive statements.

However, other studies indirectly suggest that depressed individuals might generally be more accurate in detecting lies. Harkness et al. (2005) found that dysphoric individuals were more accurate in attributing emotional states to photographs of eyes, suggesting a heightened sensitivity to social clues. More recent studies investigated the effect of mood on lie detection abilities (Forgas & East, 2008; Reinhard & Schwarz, 2012) and induced positive or negative moods in the participants. The results of those studies suggest that individuals in a negative mood show a more accurate lie-detection ability than individuals in a positive mood.

Prejudice-Related Concerns

The research field of lie detection delivers further opportunities to detect the effects of depressive realism in the form of self-serving biases in social interactions. Lloyd et al. (2017) investigated race-based biases in detecting lies. In six studies, they recruited White and Black participants to rate the veracity of statements by White and Black targets. The participants rated statements by Black targets more often as truth than they did for White targets. This effect was moderated by an interaction between internal and external motivation to respond without prejudice, measured with Plant and Devine’s (1998) Internal (IMS) and External Motivation (EMS) to Respond Without Prejudice. From the results of their six studies, Lloyd et al. concluded that individuals who were primarily internally motivated to respond without prejudice show the highest truth bias toward Black individuals. Plant and Devine suggested that people internally motivated to regulate racial prejudice “are primarily concerned with living up to their personally important, self-defining egalitarian standards” (Plant et al., 2010, p. 1136). The relationship between the internal motivation to respond without prejudice and the heightened truth bias for Black targets suggests that the heightened truth bias for Black targets might have a self-view protecting function parallel to the self-serving positivity bias found in the depressive realism literature.

Goal of This Study

Research supporting the depressive realism hypothesis shows that, unlike nondepressed individuals, depressed individuals do not exhibit a self-serving positivity bias. In light of the literature presented, we deduce that dysphoric individuals might be more accurate in classifying lies and truths than nondepressed counterparts. Therefore, our first hypothesis suggests a heightened accuracy in classifying lies and truths among dysphoric individuals (accuracy hypothesis).

Furthermore, if we can explain the truth bias as a self-serving bias like Vrij’s (2008) explanation for the generally low lie-detection ability suggests, we can also assume that dysphoric individuals do not show as much truth bias as nondysphorics. We therefore also assume that dysphoric individuals show a lower truth bias than nondysphorics (truth bias hypothesis).

Lane and DePaulo (1999) found evidence for a heightened accuracy in dysphoric individuals only for statements meant to encourage a conversational counterpart; thus, heightened accuracy might be stronger for positive statements. To test this assumption, we propose a third hypothesis, which suggests that the difference between the accuracy of dysphoric and nondysphoric individuals at detecting lies and truths is higher for positive statements than for negative statements (statement valence hypothesis).

Lloyd et al. (2017) found the racial-bias effect was moderated by an interaction between the internal and external motivation to respond without prejudice. They found that individuals primarily internally motivated to respond without prejudice show the highest racial-bias effect, which in turn suggests that such bias might serve as protection for a positive self-view. Because dysphoric individuals have an overall negative self-view, we assume that nondysphoric individuals (but not dysphoric individuals) have a higher truth bias for Black targets than for White targets (race-realism hypothesis).

Dysphoric individuals have also been found to evaluate their self-performance more accurately than nondysphoric individuals (Fu et al., 2012). Thus, we predict that dysphoric individuals show a more accurate evaluation of their ability to detect lies (expectancy accuracy hypothesis). Also, because the depressive realism hypothesis suggests a lesser illusion of control in depressed individuals (Alloy & Abramson, 1979), we expect that dysphoric individuals are less confident about their classifications of the statements as truthful or dishonest than nondysphoric individuals (confidence hypothesis).

Methods

Participants

A preregistered power analysis (see https://aspredicted.org/3b4z5.pdf), testing for a correlation between the IPIP depression score and overall detection accuracy given the true effect ρ = .15 and an α level = 0.05 one-tailed, resulted in a minimum of N = 472 participants regarded as necessary to find the assumed effect. Accordingly, the final sample size contained N = 472 participants, who were recruited via the Amazon platform mTurk. The participants had to be currently located in the United States, speak English on a mother-tongue level, approved in more than 100 studies, and have a 95% approval rate on mTurk. In return for their participation, they received $0.60. Finally, of the 701 participants who finished the study, only those were retained who correctly answered a bot/attention check (see Benz, 2023, Supplement), approved the usage of their data, and did not have any technical problems. In addition to that, one person commented on having participated twice, so their second dataset was excluded. Of the remaining 472 participants, 191 were female, 277 male, one participant identified as “other,” and four participants did not specify their gender. The participants were from 18 to 74 years (M = 36.99, SD = 11.425), most of whom (n = 348) identified as White/Caucasian Americans; 83 participants identified as African American/Black, 13 as Asian Americans, 7 as bi- or multiracial, 15 as American/Alaska Native, 3 as Native Hawaiian/Pacific Islander. Three participants did not specify their ethnicity.

Procedure

The online survey took place on the platform Unipark (Globalpark AG, Hürth, Germany). After agreeing to the terms of participation, the participants provided their mTurk worker ID, their gender, age, highest academic degree, and employment status as well as confirmation that they were located in the US and have an English proficiency at a mother-tongue level. Then we performed sound and video functionality checks. We filtered out participants who could not provide the correct color of a flower shown in the video. Thereafter, we instructed the participants to rate one of 20 sets of videos, each consisting of 16 of the overall 320 videos of average 30-second length, taken from the Miami University Deception Detection Base by Lloyd et al. (2018). The videos used consisted of statements made by 80 targets (20 Black females, Black males, 20 White females, and 20 White males). The participants rated each statement either as a truth or a lie, and they had to rate their confidence in their judgment on a scale from 0% to 100%. The targets in the videos consisted of equal numbers of honest and dishonest, positive or negative statements about an acquaintance by equal numbers of Black and White, male and female college students. Next, we assessed the participants’ depression scores using three questionnaires. Because the depression scales used most often have proved to lack symptom overlap (Fried, 2017), we chose to investigate our hypotheses with three different scales of depression to test the robustness of found effects.

The International Personality ItemPool (IPIP)N3: Depression Scale (IPIP) is a subscale of the Neuroticism scale of the IPIP 300 personality questionnaire based on the five-factor model by Costa and McCrae (1985) and consists of 10 items from the English international personality item-pool (Goldberg, 1999); a reliability score of α = .88 is reported. In this study, the subscale reached a reliability score of α = .86. All items are rated on a 5-point Likert scale (1 = strongly disagree – 5 = strongly agree), with three being inverted. In their investigation of disorder-specific predictions of the IPIP, Uliaszek et al. (2009) found the depression subscale of the IPIP to be predictive of a clinical diagnosis of major depression, having their confirmatory factor analysis demonstrating its incremental, convergent and discriminant validity. Because we were assessing a nonclinical population, we chose the IPIP to discriminate between lower levels of nonclinical depression because of its lack of assessments of more severe symptoms (i.e., suicidal thoughts).

The Patient-Health-Questionnaire – 9 (PHQ-9) by Spitzer et al. (1999) is the short version of the PRIME-MD; they report a reliability score of α = .93. All nine items assess symptoms associated with depression for the last 2 weeks and range from 0 (not at all) to 3 (nearly every day). A sum score is calculated, ranging from 0 to 27. The instrument differentiates between the five groups of healthy (0–4), mild depression (5–9), moderate depression (10–14), moderately severe depression (15–19), and severe depression (20 or higher). However, because of the low number of participants who scored 20 or higher, we merged the two highest categories in this study, which we refer to as severe depression.

The Center for Epidemiologic Studies – Depression Scale (CES-D Scale), developed by Radloff (1977), is one of the most widely used depression screening instruments. Twenty items assess statements about the past week, ranging from 0 (rarely or none of the time) to 3 (most or all of the time), with four inverted items. Total scores between 0 and 60 are possible. Most researchers use cutoffs of 16 or 21 points to differentiate between groups of depressed and nondepressed. In this study, lower values of depression were of interest, so we chose a cutoff of 16. In this study, the CES-D reached a reliability score of α = .93. Using these three instruments, we employed a depression scale more suitable to discriminating between lower values of dysphoria, an instrument assessing the nine criteria of the DSM-IV for major depression, and one instrument widely used in depression research. Moreover, we assessed dysphoria once as a continuous variable, once as a dichotomous variable, and once in different discrete severity levels, thus investigating found effects with different operationalizations of the construct.

Next, the participants provided their ethnicity and their political affiliations from Republican to Democratic from left to right on 11-point Likert scales after the question was posed: “Where do you put yourself on these two political spectrums.” Participants then provided their estimate of correctly identified videos ranging from 0 to 16 (Performance Expectancy) and their estimate on whether their lie-detection ability was generally below average, average, or above average (Self-evaluation of Lie-Detection Ability). We then implemented a bot/attention check (see Benz, 2023, Supplement). The participants also had the opportunity to report any technical errors and whether we should use their data. After this, the participants received feedback on their veracity judgment performance. On the next page, the participants could give the researchers feedback. The survey ended with a randomly generated code used for payment reception on mTurk.

For our analyses, we calculated the main criteria of overall detection accuracy as the percentage of correctly classified statements and the truth bias as the percentage of statements rated as truth.

Results

Sample Characteristics

The participants’ Depression scores ranged from 10 to 46 (M = 26.735, SD = 8.241) on the IPIP Depression Subscale (IPIP), from 0 to 27 (M = 9.710, SD = 7.051) on the PHQ-9 and from 0 to 52 (M = 21.787, SD = 12.788) on the CES-D. The distribution of IPIP scores was significantly different from normal and slightly left-skewed (γ = −.197, Kolmogorov-Smirnov: p < .001).

Based on the CES-D, n = 164 participants were categorized as nondepressed and n = 308 as depressed (see Table 1). Scores were significantly different between female (M = 25.89, SD = 8.63) and male participants (M = 27.36, SD = 7.87) neither on the IPIP t(465) = −1.907, p = .057 nor on the PHQ-9 (χ2(3, N = 467) = 5.439, p = .142, V = .108) or the CES-D (χ2(1, N = 467) = 2.305, p = .140, φ = .070).

Table 1 IPIP Depression scores depending on the groups of CES-D and PHQ-9

Judgmental Bias

On average, the participants rated 66% of all videos as truths (SD = 17.06). This value differed significantly from chance (50%) and indicated a truth bias, t(471) = 20.373, p < .001.

Contrary to our Bias Hypothesis, the IPIP correlated positively with truth bias, such that individuals with higher depression scores rated more statements as truth, r = .231, p < .001 (see also Table 2). Moreover, truth bias differed significantly between PHQ-9 groups, F(3, 468) = 18.93, p < .001, η2 = .108, such that groups of more severe depression levels had higher levels of truth bias; see Table 3. Contrast analyses showed a significant difference between the PHQ-9 group of healthy and all other groups (+3, −1, −1, −1) t(468) = −5.738, p < .001, with the healthy group rating less statements as truth, and a significant difference between the group of severely depressed and all other groups (−1, −1, −1, +3), t(468) = 5.686, p < .001, with the severely depressed group rating more statements as truth. Lastly, the CES-D showed a significant difference between groups of nondepressed (M = 59.91, SD = 14.30) and depressed (M = 69.24, SD = 17.53), F(1, 470) = 34.267, p < .001, η2 = .068, such that individuals in the depressed group showed higher levels of truth bias; see Table 4.

Table 2 Correlations of IPIP depression scores with truth bias, accuracy scores, judgment confidence, performance expectancy and expectancy accuracy in detecting lies and truths
Table 3 Means (standard deviations) of truth bias, accuracy scores, judgment confidence (%), performance expectancy (0–16), and expectancy accuracy (overall detection accuracy minus performance expectancy) in detecting lies and truths depending on PHQ-9 groups
Table 4 Means (standard deviations) of truth bias, accuracy scores, judgment confidence (%), performance expectancy (0–16), and expectancy accuracy (overall detection accuracy minus performance expectancy) in detecting lies and truths depending on CES-D groups

Further analyses revealed that the valence of the statements (positive/negative) influenced the effect of depression on judgmental bias. For analyses with the IPIP, we created a difference measure by subtracting the truth bias for negative statements from the truth bias for positive statements. The IPIP showed a significant correlation with the difference measure, r = −.153, p = .001, indicating that the positive effect of IPIP depression on truth bias was bigger in negative statements. In addition, a mixed-model 4 (PHQ) × 2 (Valence) variance analysis revealed significant main effects for PHQ-9, F(3, 468) = 18.933, p < .001, η2 = .108, and Valence, F(1, 468) = 9.848, p = .001, η2 = .021, such that positive statements were rated more often as truth and a significant interaction, F(3, 468) = 5.269, p = .001, η2 = .033, indicating that groups with less depressive symptoms rated positive statements more often as truth (see Figure 1). Another 2 (CES-D) × 2 (Valence) mixed-model variance analysis revealed a significant main effect for the CES-D, F(1, 470) = 34.267, p < .001, η2 = .068, a significant main effect for Valence, F(1, 470) = 14.066, p < .001, η2 = .029, and a significant interaction, F(1, 470) = 7.056, p = .008, η2 = .015, indicating that the difference in truth bias between positive and negative statements was lower in the depressed group: see Figure 1.

Figure 1 Interaction of valence and depression measures with truth bias as a criterion. Results of mixed-model variance analyses, with depression and valence as predictors and truth bias as a criterion. Truth bias is presented as a percentage of statements rated as truth.

Racial Bias

Overall, the number of videos judged as true did not significantly differ between Black (M = 66.74, SD = 20.67) and White targets (M = 65.25, SD = 65.25), t(471) = 1.274, p = .203. To test our Racial Bias Hypothesis with the IPIP, we calculated a difference measure by subtracting the truth bias for White targets from the truth bias for Black targets. Aligned with our hypothesis, the IPIP showed a significant negative correlation with the difference measure, r = −.096, p = .037, meaning a lower difference in truth bias for Black and White targets in participants with higher IPIP scores. Furthermore, a mixed-model variance analysis with the within-subject factor target race and the between-subject factor PHQ-9 revealed no significant main effect for Target Race, F(1, 468) = 2.096, p = .148, ηp2 = .004, a significant main effect for the PHQ-9, F(3, 468) = 18.93, p < .001: More severe depression groups had higher levels of truth bias, ηp2 = .108, and a significant interaction aligning with our Racial Bias Hypothesis, F(3, 468) = 2.822, p = .038, meaning that a heightened truth bias for Black targets was found in the healthy group but decreased in higher groups of depression; see Figure 2. A 2 (CES-D) × 2 (Target race) mixed-model variance analysis resulted in a nonsignificant main effect for Target Race, F(1, 470) = 3.722, p = .054, ηp2 = .008, a significant main effect for the CES-D, F(1, 470) = 34.267, p < .001, ηp2 = .068, such that the group of depression had a higher level of truth bias and a significant interaction also aligning with our hypothesis, F(1, 470) = 5.426, p = .020, ηp2 = .011, showing a heightened truth bias for Black targets in the nondepressed group but not in the depressed group (see Figure 2).

Figure 2 Interaction of target race and depression measures with truth bias as a criterion. Results of mixed-model variance analyses, with depression and target race as predictors and truth bias as a criterion. Truth bias is presented as a percentage of statements rated as truth.

Detection Accuracy

Overall detection accuracy was, on average, 51.64% (SD = 11.21). This value differed significantly from what would be expected by chance (50%), t(471) = 3.182, p = .002. There was no significant difference between the accuracy at identifying positive statements (M = 51.17, SD = 15.75) and negative statements (M = 52.12, SD = 15.90), t(471) = −0.927, p = .354.

Analyses with the IPIP Depression Measure

In contrast to our Accuracy Hypothesis, the results showed no significant correlation between overall accuracy and IPIP, r = −.062, p = .178. To test our Statement Valence Hypothesis with the IPIP, we calculated a difference measure by subtracting the overall accuracy for negative statements from the overall accuracy for positive statements. In contrast to our Statement Valence Hypothesis, the IPIP did not significantly correlate with the difference measure, r = .028, p = .547, nor did it correlate with the number of correctly identified positive, r = −.024, or the negative statements, r = −.063, p = .170.

Analyses with the PHQ-9 Depression Measure

In contrast to our Accuracy Hypothesis, a variance analysis with the PHQ-9 revealed no significant difference in overall detection accuracy, F(3, 468) = 1.567, p = .197, η2 = .010. Contrast analyses showed no difference between the healthy group and all other groups, t(468) = 1.036, p = .301, nor between the severely depressed group and all other groups, t(468) = −1.411, p = .159.

To test our Statement Valence Hypothesis with the PHQ-9, we used a mixed-model ANOVA with the within-subject factor Valence, which also revealed a significant main effect for PHQ-9 neither for overall detection accuracy, F(3, 468) = 1.567, p = .197, ηp2 = .010, nor for Valence, F(1, 468) < .001, p = .990, ηp2 < .001. However, the interaction between PHQ-9 and Valence was significant, F(3, 468) = 4.140, p = .007, ηp2 = .026; see Figure 3. Further analyses revealed that the difference between PHQ-9 groups in overall detection accuracy is significant in negative statements, F(3, 468) = 4.050, p = .001, ηp2 = .025, with higher accuracy in depressed participants, but not significant in positive statements, F(3, 468) = 1.622, p = .163, ηp2 = .010, therefore going against the hypothesized direction.

Figure 3 Interaction of valence and depression measures with overall accuracy as a criterion. Results of mixed-model variance analyses, with depression and valence as predictors and overall rating accuracy as a criterion. Overall rating accuracy is presented as a percentage of statements rated correctly.

Analyses with the CES-D Depression Measure

In contrast to our Accuracy Hypothesis, a variance analysis with the CES-D depression measure revealed no significant difference between nondepressed (M = 51.83, SD = 11.29) and depressed (M = 51.54, SD = 11.19), F(1, 470) = .070, p = .791, η2 < .001.

A mixed-model analysis adding the factor Valence revealed no significant difference in overall accuracy between CES-D groups, F(1, 470) = .070, p = .791, ηp2 < .001, no main effect of Valence, F(1, 470) = .946, p = .331, ηp2 = .002, and no significant interaction between CES-D and Valence, F(1, 470) = .088, p = .767, ηp2 < .001; see Figure 3.

Judgmental Confidence

Confidence averaged over all videos (ranging from 9.44 to 100.00, M = 66.72, SD = 11.87) did not significantly correlate with overall detection accuracy, r = −.066, p = .154. Also, there was no significant difference between the confidence for White targets (M = 66.69, SD = 12.26) and Black targets (M = 66.73, SD = 12.45). However, a difference measure that subtracted overall confidence for White targets from Black targets correlated with the difference measure for truth bias between Black and White targets, r = .196, p < .001, implicating heightened confidence in classifications of statements by Black targets for participants with a heightened truth bias for Black targets.

In alignment with our Confidence Hypothesis, the IPIP scores correlated significantly with Confidence, r = −.153, p = .001, such that individuals with higher IPIP scores reported lower confidence in their ratings. Moreover, there was a significant difference between PHQ-9 groups, F(4, 768) = 7.010, p < .001, η2 = .043, such that the healthy group reported higher confidence in their ratings; see Table 3. Contrast analyses showed a significant difference between the healthy group and all other groups, such that the healthy group reported significantly higher confidence in their ratings than all other groups, t(468) = 2.927, p = .004, and a nonsignificant difference between the severely depressed group and all other groups, t(468) = 1.880, p = .061. Also, the CES-D group of depressed participants was significantly less confident (M = 65.67, SD = 11.58) than the group of nondepressed participants (M = 65.67, SD = 11.58), F(1, 470) = 6.884, p = .009, η2 = .014; see also Table 4.

Detection Ability Estimations

We tested the Expectancy Accuracy Hypothesis with the two variables of Self-evaluation and Performance expectancy. In Self-evaluation, 38 participants rated their lie-detection ability as below average, 326 as average, and 108 as above average. IPIP scores were significantly different between Self-evaluation groups of below average (M = 27.24, SD = 8.12), average (M = 27.59, SD = 8.07), and above average (M = 24.11, SD = 8.24), F(2, 469) = 7.504, p = .001, η2 = .031, such that individuals that made a self-evaluation of above average had lower IPIP scores. The difference in Self-evaluation between PHQ-9 groups was not significant, χ2(6, N = 472) = 11.992, p = .062, V = .113; however, the difference in Self-evaluation between CES-D groups was significantly different, χ2(2, N = 472) = 8.346, p = .015, φ = .133.

To test our Expectancy Accuracy Hypothesis with the Self-evaluation variable and the IPIP, we indicator-coded Self-evaluation, resulting in the focal predictors “average” and “above average” and performed a multiple regression, with IPIP as moderator and overall detection accuracy as criteria. Overall, the amount of explained variance was not significant, R2 = .100, p = .453, nor were the focal predictors of “average” and “above average,” baverage = 1.463, t(466) = 0.215, p = .830, 95% CI [−11.931; 14.858], babove average = −2.4029, t(466) = −0.331, p = .741, 95% CI [−16.931;11.874]; nor was IPIP as predictor, bipip = −0.074, p = .745, t(466) = −0.326, 95% CI [−0.520; 0.372]. The interaction of “average” and IPIP was not significant, bipip×average = −0.050, p = 835, t(466) = −0.209, 95% CI [−0.521;0.421]. Neither was the interaction of “above average” and IPIP, bipip×above average = 0.023, t(466) = 0.086, p = .931, 95% CI [−0.493; 0.538]. In a 3 (Self-evaluation) × 4 (PHQ-9) between-subject variance analysis with overall detection accuracy as dependent variable, there were no main effects for either PHQ-9, F(3, 460) = 0.699, p = .553, η2 = .007, Self-evaluation, F(2, 460) = 1.229, p = .293, η2 = .005, and no significant interaction between PHQ-9 and Self-evaluation, F(6, 460) = 0.558, p = .764, η2 = .007. A 3 (Self-evaluation) × 2 (CES-D) between-subject variance analysis revealed no significant main effects for CES-D, F(1, 466) = 0.175, p = .676, η2 < .001, or Self-evaluation, F(2, 466) = 0.998, p = .369, η2 = .004, nor a significant interaction between Self-evaluation and CES-D, F(2, 466) = 0.021, p = .979, η2 < .001.

Performance Expectancy ranged from 0 to 16 videos (M = 10.83, SD = 3.14). Performance Expectancy correlated significantly with judgmental confidence, r = 316, p < .001, such that individuals who reported higher confidence in their judgments had a higher performance expectancy. It also correlated with truth bias, r = 220, p < .001, such that individuals with higher performance expectancy had a higher truth bias but not with overall detection accuracy, r = −.018, p = .698. The IPIP did not significantly correlate with Performance Expectancy, r = .006, p = .901. Moreover, Performance Expectancy did not differ between PHQ-9 groups, F(3, 466) = 0.990, p = .397, η2 = .006. Contrast analyses revealed no significant difference between the healthy group and all other groups, t(466) = −1.648, p = .100, nor between the severely depressed group and all other groups, t(466) = 0.597, p = .551. Neither did Performance Expectancy differ among CES-D groups, F(1, 468) = 0.959, p = .328, η2 = .002.

To test our Expectancy Accuracy Hypothesis with the Performance Expectancy measure, we created a difference measure (Expectancy Accuracy) by subtracting the expected number of correctly judged videos from the actual number of correctly judged videos. Against our prediction (Expectancy Accuracy Hypothesis), Expectancy Accuracy (ranging from −11 to 9, M = −2.56, SD = 3.64) did not significantly correlate with the IPIP, r = .006, p = .901. Nor was there a difference between PHQ-9 groups, F(3, 466) = 1.492, p = .216, η2 = .010. Contrast analyses revealed no significant difference between the healthy group and all other groups, t(466) = 1.924, p = .055, nor a significant difference between the severely depressed group and all other groups, t(466) = −1.191, p = .234. There was also no significant difference between CES-D groups, F(1, 468) = 0.921, p = .338, η2 = .002.

Discussion

Judgmental Bias and Depressive Realism

The depressive realism hypothesis suggests that depressed (or rather dysphoric) individuals (see Moore & Fresco, 2012) see the world more realistic than the nondysphoric do (Alloy & Abramson, 1979). In the lie-detection setting, we assumed this would indicate a lesser truth bias for dysphoric individuals (Truth Bias Hypothesis). However, our results did not align with our truth-bias hypothesis, and using three different assessment measures of depression suggests that dysphoric individuals show more of a truth bias than their nondysphoric counterparts. A study by Martin et al. (1984) found that depressed individuals do not succumb to the illusion of control when assessing their own influence but do so with others, which aligns with our results. The effect was moderated by the valence of the statements: Depressed individuals rated statements more often as true when the targets depicted an acquaintance negatively. This finding implies that depressed individuals might have a mood-congruent bias, as Weightman et al. (2014) described in their interpretation of cognitive stimuli.

Racial Bias and Depressive Realism

Lloyd et al. (2017) found that statements made by Black targets were more often rated as true than statements made by White Targets. Furthermore, they found that individuals with a primarily internal motivation to regulate racial prejudice had the highest racial truth bias, which might be interpreted as a rating behavior that protects a positive self-view. We assumed that dysphoric individuals would be less motivated to regulate racial prejudice because of this internal motivation having a self-view-regulating effect and depressed individuals having an overall negative self-view. Our study revealed no overall heightened truth bias for Black targets. However, as our Racial-Realism Hypothesis predicted, the racial truth bias effect interacted with all three depression measures. Dysphoric individuals had a less heightened truth bias for Black targets than did nondepressed individuals. This effect can be called a depressive realism effect, as a self-view protecting bias did not distort the proportions of truth and lie ratings. We should mention, however, that, because of the length of our study, we did not assess concerns about being prejudiced. Future studies should test whether the importance of the internal motivation to not be prejudiced is responsible for the difference between dysphoric and nondysphoric individuals in the racial truth bias.

Detection Accuracy and Depressive Realism

In contrast to our accuracy hypothesis, which suggests a heightened overall detection accuracy in dysphoric individuals, there was no significant difference in overall detection accuracy between dysphoric and nondysphoric individuals on any of the depression assessment scales. Therefore, our results do not support the depressive realism hypothesis in the perception of others. Our Statement Valence Hypothesis suggested an interaction of statement valence and depression, such that the overall rating accuracy would have been higher in positive statements than in negative statements. Overall detection accuracy was higher in positive statements than in negative statements for all participants. However, this effect did not interact with the IPIP depression score or the CES-D. Only on the PHQ-9 was the interaction of depression and valence significant, with only the low-depression group having a higher accuracy on positive than negative statements (see also Benz & Reinhard, 2021). Because the results did not differ among all depression scales, the results do overall not support a heightened accuracy for depressed participants for any lies and do not align with the results of Lane and DePaulo (1999). While our results might challenge Lane and DePaulo’s explanation of their results as a heightened sensitivity for phony reassurances with the alternative explanation of a mood-congruent bias, Lane and DePaulo investigated the perception of honest and dishonest statements with the intent to enhance social interactions. The statements the targets made in this study reveal honest or dishonest statements about associates they either liked or disliked, without such intent. Future studies should investigate whether one would find heightened detection accuracy in lies with the intent to enhance social relations.

Confidence, Self-evaluation of Lie-Detection Ability and Depressive Realism

Our Confidence Hypothesis suggested that, in line with the depressive realism hypothesis, the participants would not be as sure about their ability to detect lies and would therefore believe less in the reliability of their veracity judgments. In alignment with this, the average confidence measure was lower among dysphoric participants on all depression measurement scales.

Our Expectancy Accuracy Hypothesis suggested a more realistic perception in dysphoric participants of their lie-detection ability and thus resembled previous investigations on depressive realism more closely. As with the confidence measure analyses, dysphoric participants had lower performance expectations on all depression scales. However, analyses investigating whether these assessments were more accurate failed to confirm the Expectancy Accuracy Hypothesis and therefore question the depressive realism view in self-perception.

Limitations

The first limitation lies in our recruitment process. Because of the relatively low payment for participation, there might have been a risk of careless responders reducing the quality of our sample. Also, one should interpret our results concerning the racial bias effect with care, as we did not assess concerns about being prejudiced. Further studies should investigate whether reduced motivation is unprejudiced in individuals with depressive symptoms.

We thank E. Paige Lloyd, Jason C. Deska, Kurt Hugenberg, Allen R. McConnell, Brandon T. Humphrey, and Jonathan W. Kunstman for providing the stimulus material for these studies.

References

  • Alloy, L. B., & Abramson, L. Y. (1979). Judgment of contingency in depressed and nondepressed students: Sadder but wiser? Journal of Experimental Psychology: General, 108(4), 441–485. https://doi.org/10.1037/0096-3445.108.4.441 First citation in articleGoogle Scholar

  • Alloy, L. B., & Ahrens, A. H. (1987). Depression and pessimism for the future: Biased use of statistically relevant information in predictions for self versus others. Journal of Personality and Social Psychology, 52(2), 366–378. https://doi.org/10.1037/0022-3514.52.2.366 First citation in articleGoogle Scholar

  • Beck, A. T. (1963). Thinking and depression: I. Idiosyncratic content and cognitive distortions. Archives of General Psychiatry, 9(4), 324–333. https://doi.org/10.1001/archpsyc.1963.01720160014002 First citation in articleGoogle Scholar

  • Beck, A. T. (1967). Depression: Clinical, experimental, and theoretical aspects. Harper & Row. First citation in articleGoogle Scholar

  • Benz, D. (2023, July 2). Depression, race-based bias and judgments of veracity. OSF. https://osf.io/y5b9c/ First citation in articleGoogle Scholar

  • Benz, D., & Reinhard, M.-A. (2021). Sadder and less gullible? – Investigating depressive realism effects in judgments of veracity. Journal of Social and Clinical Psychology, 40, 404–421. https://doi.org/10.1521/jscp.2021.40.5.421 First citation in articleGoogle Scholar

  • Blanco, F., Matute, H., & Vadillo, M. A. (2011). Making the uncontrollable seem controllable: The role of action in the illusion of control. The Quarterly Journal of Experimental Psychology, 64(7), 1290–1304. https://doi.org/10.1080/17470218.2011.552727 First citation in articleGoogle Scholar

  • Bond, C. F. Jr., & DePaulo, B. M. (2006). Accuracy of deception judgments. Personality and Social Psychology Review, 10(3), 214–234. https://doi.org/10.1207/s15327957pspr1003_2 First citation in articleGoogle Scholar

  • Costa, P. T., & McCrae, R. R. (1985). The NEO Personality Inventory. Psychological Assessment Resources. First citation in articleGoogle Scholar

  • Dennard, D. O., & Hokanson, J. E. (1986). Performance on two cognitive tasks by dysphoric and nondysphoric students. Cognitive Therapy and Research, 10(3), 377–386. https://doi.org/10.1007/BF01173473 First citation in articleGoogle Scholar

  • DePaulo, B. M., Charlton, K., Cooper, H., Lindsay, J. J., & Muhlenbruck, L. (1997). The accuracy-confidence correlation in the detection of deception. Personality and Social Psychology Review, 1(4), 346–357. https://doi.org/10.1207/s15327957pspr0104_5 First citation in articleGoogle Scholar

  • DePaulo, B. M., Lindsay, J. J., Malone, B. E., Muhlenbruck, L., Charlton, K., & Cooper, H. (2003). Cues to deception. Psychological Bulletin, 129(1), 74–118. https://doi.org/10.1037/0033-2909.129.1.74 First citation in articleGoogle Scholar

  • Forgas, J. P., & East, R. (2008). On being happy and gullible: Mood effects on skepticism and the detection of deception. Journal of Experimental Social Psychology, 44(5), 1362–1367. https://doi.org/10.1016/j.jesp.2008.04.010 First citation in articleGoogle Scholar

  • Fried, E. I. (2017). The 52 symptoms of major depression: Lack of content overlap among seven common depression scales. Journal of Affective Disorders, 208, 191–197. https://doi.org/10.1016/j.jad.2016.10.019 First citation in articleGoogle Scholar

  • Fu, T. S.-T., Koutstaal, W., Poon, L., & Cleare, A. J. (2012). Confidence judgment in depression and dysphoria: The depressive realism vs. Negativity hypotheses. Journal of Behavior Therapy and Experimental Psychiatry, 43(2), 699–704. https://doi.org/10.1016/j.jbtep.2011.09.014 First citation in articleGoogle Scholar

  • Gara, M. A., Woolfolk, R. L., Cohen, B. D., Goldston, R. B., Allen, L. A., & Novalany, J. (1993). Perception of self and other in major depression. Journal of Abnormal Psychology, 102(1), 93–100. https://doi.org/10.1037/0021-843X.102.1.93 First citation in articleGoogle Scholar

  • Goldberg, L. R. (1999). A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several five-factor models. Personality Psychology in Europe, 7(1), 7–28. First citation in articleGoogle Scholar

  • Gotlib, I. H., & Meltzer, S. J. (1987). Depression and the perception of social skill in dyadic interaction. Cognitive Therapy and Research, 11(1), 41–53. https://doi.org/10.1007/BF01183131 First citation in articleGoogle Scholar

  • Harkness, K., Sabbagh, M., Jacobson, J., Chowdrey, N., & Chen, T. (2005). Enhanced accuracy of mental state decoding in dysphoric college students. Cognition & Emotion, 19(7), 999–1025. https://doi.org/10.1080/02699930541000110 First citation in articleGoogle Scholar

  • Hartwig, M., & Bond, C. F. Jr. (2011). Why do lie-catchers fail? A lens model meta-analysis of human lie judgments. Psychological Bulletin, 137(4), 643–659. https://doi.org/10.1037/a0023589 First citation in articleGoogle Scholar

  • Lane, J. D., & DePaulo, B. M. (1999). Completing Coyne’s cycle: Dysphorics’ ability to detect deception. Journal of Research in Personality, 33(3), 311–329. https://doi.org/10.1006/jrpe.1999.2253 First citation in articleGoogle Scholar

  • Langer, E. J. (1975). The illusion of control. Journal of Personality and Social Psychology, 32(2), 311–328. https://doi.org/10.1037/0022-3514.32.2.311 First citation in articleGoogle Scholar

  • Lloyd, E. P., Deska, J. C., Hugenberg, K., McConnell, A. R., Humphrey, B. T., & Kunstman, J. W. (2019). Miami University deception detection database. Behavior Research Methods, 51(1), 429–439. https://doi.org/10.3758/s13428-018-1061-4 First citation in articleGoogle Scholar

  • Lloyd, E. P., Hugenberg, K., McConnell, A. R., Kunstman, J. W., & Deska, J. C. (2017). Black and White lies: Race-based biases in deception judgments. Psychological Science, 28(8), 1125–1136. https://doi.org/10.1177/0956797617705399 First citation in articleGoogle Scholar

  • Martin, D. J., Abramson, L. Y., & Alloy, L. B. (1984). Illusion of control for self and others in depressed and nondepressed college students. Journal of Personality and Social Psychology, 46(1), 125–136. https://doi.org/10.1037/0022-3514.46.1.125 First citation in articleGoogle Scholar

  • Matute, H. (1995). Human reactions to uncontrollable outcomes: Further evidence for superstitions rather than helplessness. The Quarterly Journal of Experimental Psychology, 48(2), 142–157. https://doi.org/10.1080/14640749508401444 First citation in articleGoogle Scholar

  • Moore, M. T., & Fresco, D. M. (2012). Depressive realism: A meta-analytic review. Clinical Psychology Review, 32(6), 496–509. https://doi.org/10.1016/j.cpr.2012.05.004 First citation in articleGoogle Scholar

  • Oliver, J. M., & McGee, J. (1982). Cognition as a function of depression in a student population: Content and complexity of cognitions. Cognitive Therapy and Research, 6(3), 275–285. https://doi.org/10.1007/BF01173576 First citation in articleGoogle Scholar

  • O’Sullivan, M., Ekman, P., & Friesen, W. V. (1988). The effect of comparisons on detecting deceit. Journal of Nonverbal Behavior, 12(3), 203–215. https://doi.org/10.1007/BF00987488 First citation in articleGoogle Scholar

  • Plant, E. A., & Devine, P. G. (1998). Internal and external motivation to respond without prejudice. Journal of Personality and Social Psychology, 75(3), 811–832. https://doi.org/10.1037/0022-3514.75.3.811 First citation in articleGoogle Scholar

  • Plant, E. A., Devine, P. G., & Peruche, M. B. (2010). Routes to positive interracial interactions: Approaching egalitarianism or avoiding prejudice. Personality and Social Psychology Bulletin, 36(9), 1135–1147. https://doi.org/10.1177/0146167210378018 First citation in articleGoogle Scholar

  • Radloff, L. S. (1977). The CES-D scale: A self-report depression scale for research in the general population. Applied Psychological Measurement, 1(3), 385–401. https://doi.org/10.1177/014662167700100306 First citation in articleGoogle Scholar

  • Reinhard, M. A., & Schwarz, N. (2012). The influence of affective states on the process of lie detection. Journal of Experimental Psychology: Applied, 18(4), 377–389. https://doi.org/10.1037/a0030466 First citation in articleGoogle Scholar

  • Ruehlman, L. S., West, S. G., & Pasahow, R. J. (1985). Depression and evaluative schemata. Journal of Personality, 53(1), 46–92. https://doi.org/10.1111/j.1467-6494.1985.tb00888.x First citation in articleGoogle Scholar

  • Seligman, M. E. (1972). Learned helplessness. Annual Review of Medicine, 23(1), 407–412. https://doi.org/10.1146/annurev.me.23.020172.002203 First citation in articleGoogle Scholar

  • Soderstrom, N. C., Davalos, D. B., & Vázquez, S. M. (2011). Metacognition and depressive realism: Evidence for the level-of-depression account. Cognitive Neuropsychiatry, 16(5), 461–472. https://doi.org/10.1080/13546805.2011.557921 First citation in articleGoogle Scholar

  • Spitzer, R. L., Kroenke, K., Williams, J. B., & Patient Health Questionnaire Primary Care Study Group (1999). Validation and utility of a self-report version of PRIME-MD: The PHQ primary care study. JAMA, 282(18), 1737–1744. https://doi.org/10.1001/jama.282.18.1737 First citation in articleGoogle Scholar

  • Uliaszek, A. A., Hauner, K. K., Zinbarg, R. E., Craske, M. G., Mineka, S., Griffith, J. W., & Rose, R. D. (2009). An examination of content overlap and disorder-specific predictions in the associations of neuroticism with anxiety and depression. Journal of Research in Personality, 43(5), 785–794. https://doi.org/10.1016/j.jrp.2009.05.009 First citation in articleGoogle Scholar

  • Venkatesh, S., Moulds, M. L., & Mitchell, C. J. (2018). Testing for depressive realism in a clinically depressed sample. Behaviour Change, 35(2), 108–122. https://doi.org/10.1017/bec.2018.12 First citation in articleGoogle Scholar

  • Vrij, A. (2008). Detecting lies and deceit: Pitfalls and opportunities. Wiley. First citation in articleGoogle Scholar

  • Vrij, A., & Baxter, M. (1999). Accuracy and confidence in detecting truths and lies in elaborations and denials: Truth bias, lie bias and individual differences. Expert Evidence, 7(1), 25–36. https://doi.org/10.1023/A:1008932402565 First citation in articleGoogle Scholar

  • Weightman, M. J., Air, T. M., & Baune, B. T. (2014). A review of the role of social cognition in major depressive disorder. Frontiers in Psychiatry, 5, Article 179 . https://doi.org/10.3389/fpsyt.2014.00179 First citation in articleGoogle Scholar

  • Yarritu, I., Matute, H., & Vadillo, M. A. (2014). Illusion of control: The role of personal involvement. Experimental Psychology, 61(1), 38–47. https://doi.org/10.1027/1618-3169/a000225 First citation in articleLinkGoogle Scholar