← All articles

May 22, 2026 · Steffen Foerster · 14 min read

Why Great Images Don't Always Win: The Psychology Behind Nature Photography Contest Results

After years of entering and studying major nature photography contests, I've come to understand that judging is deeply human — and human evaluation follows patterns more complicated than we like to admit.

Why Great Images Don't Always Win: The Psychology Behind Nature Photography Contest Results

Giant Petrel · Gold Award, Bird Portraits · Bird Photographer of the Year 2025

© Steffen Foerster

Over the last several years I've been fortunate to have my own work recognized in a number of major nature photography contests. Following contests closely — both as a participant and observer — has only made me more fascinated by how difficult some results can be to fully explain.

Most winning images are excellent. That part isn't really in question. But spend enough time studying contest results and you'll eventually encounter outcomes that leave you genuinely uncertain about what separated one image from another. Sometimes a winning photograph feels very close to work already recognized years earlier. Sometimes a commended image strikes you as the stronger photograph. And sometimes, particularly at the overall winner level, the final choice seems shaped not just by photographic execution but also by storytelling, conservation relevance, emotional resonance, or the kind of statement a contest may want its winning image to make that year.

None of this means judging is broken or dishonest. More often, it reflects something simpler and far more human: once photography reaches a very high level, outcomes are shaped not just by image quality itself, but also by psychology, context, visual trends, familiarity, rarity, and the reality that different people respond to different things.

Understanding those patterns won't change any results. But it may change how you interpret them, how you decide what to enter, and how much emotional weight you place on any single outcome.

Why Judges Can't Fully Explain Their Own Decisions

One of the most unsettling findings in the psychology of aesthetic judgment is that people — including experts — often cannot accurately account for why they preferred one thing over another.

Psychologist Petter Johansson and colleagues designed an experiment in which participants were shown pairs of photographs and asked to choose which face they found more attractive. Immediately after choosing, they were handed a card and asked to explain their decision. On certain trials, the experimenter used sleight of hand to give participants the card they hadn't chosen. Most participants didn't notice. More strikingly, they went on to confidently articulate reasons for preferring the image they had actually rejected — describing features of the non-chosen face as the very qualities that made it appealing. The effect has since been replicated across aesthetic beauty judgments of abstract images, taste tests, and even moral attitudes.

The researchers' conclusion was that our introspective reports about aesthetic choices may be largely post-hoc constructions — narratives we build after the fact to explain decisions that were driven by processes we don't have conscious access to.

What this means for contest judging is significant. When a judge explains why a winning image deserved to win, they are probably telling a coherent and sincere story. But that story may not fully describe what actually drove the judgment. The articulated reason — the light was extraordinary, the composition was perfectly balanced — may be a rationalization of a response that was formed much faster, and under the influence of factors neither the judge nor the photographer can easily see.

How Quickly an Image Reads

There is a well-documented psychological phenomenon called processing fluency: the ease with which the brain processes a visual stimulus generates a positive affective signal that we experience as aesthetic pleasure. The more readily an image resolves — the faster the subject separates from the background, the more clearly the visual hierarchy organizes itself, the more immediately the emotional content registers — the more positively it tends to be received.

Research by Reber, Schwarz, and Winkielman found that this effect operates specifically at short exposure durations. When the same image was shown for a fraction of a second versus ten seconds, figure-ground contrast influenced aesthetic judgments strongly at brief exposures but not at longer ones. At ten seconds, viewers could perceive the objective features of the image fully regardless of contrast — but at brief exposures, how quickly the image resolved determined how much they liked it. In other words, processing fluency is an artifact of limited viewing time. It disappears when viewers have the time to actually look.

In early judging rounds at large competitions — Wildlife Photographer of the Year draws over 60,000 entries — judges are moving through hundreds of images per session. Under those conditions, images with strong subject separation, clear visual hierarchy, and immediate emotional legibility are structurally advantaged over work whose strengths reveal themselves more slowly. A quieter image, a more ambiguous composition, a photograph that becomes more powerful after thirty seconds of attention — these may not get the viewing conditions they need in early rounds.

This helps explain something photographers often notice when reviewing winning images at home: work that felt instantly striking during a live session can feel less exceptional in slow, isolated viewing, while subtler images that didn't advance can feel increasingly strong the longer you spend with them. The photographs haven't changed. The viewing conditions have.

The Reality of Group Judgment

Most major contests rely on judging panels rather than individuals, which intuitively feels fairer — and in many ways it is. Group judging helps reduce the influence of any one person's preferences or blind spots.

But group evaluation also introduces its own dynamics that work against the assumption that more reviewers simply means more accuracy.

Research on group decision-making identifies three distinct failure modes relevant to panel judging. The first is group amplification of error: groups sometimes reinforce individual errors rather than correct them, particularly when one judge's strong early reaction anchors the discussion. The second is conformity effects: panel members may defer to more senior colleagues or to the apparent emerging consensus, even when they privately disagree — with vocal members often driving an outcome that quieter dissenters would not have chosen independently. The third is group polarization: when groups do converge, they tend to land at a more extreme position than any individual member would have chosen on their own — a well-documented phenomenon across jury research, organizational psychology, and political decision-making.

Panel size compounds this. Research consistently shows that smaller groups encourage more creative divergence, while larger panels increase conformity pressure. A seven-person jury evaluating tens of thousands of entries faces very different psychological dynamics than a three-person panel for a regional competition.

The practical consequence is that the images performing best in panel settings are often not the photographs that provoke the strongest reaction from one individual judge — they are the images capable of maintaining broad support across a diverse panel. That is a subtly different quality. An image that one judge finds transcendent but another finds confusing may not advance. An image that everyone finds excellent but nobody finds transcendent might. This helps explain why contest winners sometimes feel more immediately readable than the work that genuinely surprised you elsewhere in the same collection.

Order, Fatigue, and the Luck of the Sequence

How an image is experienced depends substantially on where in the sequence it falls — a dynamic that research has documented rigorously across competitive contexts.

Psychology identifies two competing effects in sequential evaluation. A primacy effect means that early entries are judged against an implicit standard that hasn't yet been calibrated by what follows. A recency effect means that later entries benefit from being freshest in judges' memory when final decisions are made. Studies of song competitions, wine tastings, and talent contests consistently find that entries in the middle of a long sequence are at a structural disadvantage: they lose to neither effect. Research suggests that in sequential evaluation, appearing either first or last is measurably preferable to landing in the middle of a long session.

There is also a contrast effect operating throughout the process. Our sensory systems are wired to encode differences rather than absolute values — we judge things relative to what we've just seen, not against a fixed internal standard. A quiet environmental portrait arriving after a run of dramatic action images may suddenly register as powerful. The same image arriving after a sequence of similarly quiet work may disappear. The photograph hasn't changed. The context has. This is not a quirk of photography judging specifically — it is a fundamental property of how human perception operates, documented across domains from taste to visual aesthetics to hiring decisions.

Rarity, Narrative, and What Contests Actually Reward

In wildlife photography, the perceived value of an image is never purely about photographic execution. Rarity matters — both in the ecological sense and in the psychological one.

Research in behavioral economics has consistently shown that people assign greater value to things perceived as scarce or difficult to access. In wildlife photography, this plays out in the gap between an extraordinary image of a familiar species and a compositionally simpler image of a genuinely rare subject or behavior. The two can be genuinely difficult for judges to compare because they aren't measuring the same thing — they're evaluating photographic craft in one case and a form of documentary witness in the other.

This is not a flaw in judging. It's part of what makes wildlife photography compelling. The question of what it means to produce the best image in a contest — the most skillfully made, or the most significant — is never fully resolved, and the tension shows up differently in different competitions.

Wildlife Photographer of the Year explicitly names originality, narrative, and ethical practice as its core criteria, and states that it strongly favors photographs not already recognized in similar competitions. Its jury has said publicly that "an imperfect, never-before-seen moment can stand up against the most beautifully composed photograph." Bird Photographer of the Year places explicit weight on the story behind the image and the behavioral or conservation context the caption provides. These aren't vague aspirations — they are genuine signals about where the weight falls when judges are choosing between images that are all technically strong.

Understanding a competition's stated and unstated priorities is one of the more consistently undervalued strategic decisions a photographer can make.

Trend Cycles and the Feedback Loop Contests Create

One of the more counterintuitive dynamics in contest photography is that rewarding a visual style can eventually undermine its own impact.

The black-background wildlife portrait is a useful case study. When photographers began working seriously in this low-key, chiaroscuro style around 2010, it was genuinely novel — borrowed from studio portrait traditions but rarely applied to wild subjects in the field. Early entrants in this style stood out precisely because the approach hadn't been seen in competition. Recognition followed. Other photographers studied those results and began exploring similar techniques. The style spread, becoming one of the more commonly entered approaches in bird and mammal portrait categories. What had once felt arresting began to feel familiar, and eventually predictable — not because the technique became less skillful, but because the visual territory became crowded.

This is the feedback loop that contest ecosystems reliably produce. Strong results create visible benchmarks. Photographers absorb those benchmarks and internalize the visual language. The language spreads through the community. Saturation builds, and aesthetic fatigue sets in — often without anyone consciously deciding it has happened. The judges who once found the approach striking now encounter it dozens of times per session, and the processing fluency that once felt like immediate power begins to feel like familiarity instead.

Trend awareness, in this sense, is a genuinely useful analytical tool — not about chasing what's current, but about understanding which visual territories still have room to register as fresh within the current competitive landscape, and which have become crowded enough that even strong execution may struggle to stand out.

Different Contests, Different Conversations

Competitions do not all reward the same qualities, and the gap between what different contests value can be wide enough that the same image performs very differently depending on where it's entered.

Some competitions consistently favor emotionally dramatic wildlife behavior. Others lean graphic or artistic. Some explicitly weight conservation storytelling and the broader significance of what was documented. WPY's Impact Award is reserved specifically for images that recognize a conservation success or a story of hope — a category that doesn't exist at all in competitions focused purely on photographic craft. These structural differences in what each contest is trying to celebrate shape outcomes in ways that have nothing to do with absolute image quality.

Recognizing this reframes rejection in a useful way. Sometimes an image fails not because it is weak, but because it encountered a panel whose priorities did not align with what the image was offering. That is genuinely different information from "this photograph is not good enough," and it points toward a different response: not reworking the image, but finding the competition whose judging culture is actually a better fit for it.

Studying a contest's judging history across multiple years is often more useful preparation than trying to produce universally successful images. Understanding what a specific competition has consistently rewarded — and what it seems to have moved past — is a form of strategic intelligence that serious competitors develop over time.

What This Means in Practice

A few conclusions follow that are practically useful rather than merely interesting.

A single result carries limited information. Contest outcomes are shaped by panel composition, comparison sets, judging order, stylistic fit, and timing — all largely invisible to the photographer. Strong conclusions drawn from one acceptance or rejection are unreliable. Meaningful patterns only emerge across many contests and many years.

Slow-reading images face a structural disadvantage in early rounds. Work whose strengths reveal themselves gradually is most exposed to the processing fluency effect — it needs viewing time that early-round judging often can't provide. This doesn't mean such work shouldn't be entered. It means identifying which competitions create conditions where that kind of image has a real chance to be seen properly, and prioritizing those.

The fundamentals still determine most outcomes. The psychological dynamics described here operate mostly at the margins between already strong images. None of them substitute for fieldcraft, patience, behavioral knowledge, or the ability to consistently produce compelling photographs. The vast majority of entries are separated from the winning work by factors that have nothing to do with panel dynamics or sequence effects.

Your own judgment matters independently. A photograph that fails in competition is not automatically a failed photograph. Contest results represent the outcome of a particular evaluative process involving a specific group of people under specific conditions on a specific day. That is useful information — but it is not the same thing as an objective verdict on the image.

A Final Thought

The longer I spend around photography contests, the less I see them as objective rankings of photographic quality and the more I see them as complex, layered conversations — between images, judges, traditions, expectations, psychology, and timing.

What the research makes clear is that human evaluation of visual work is far less transparent than it feels from the inside. Judges form genuine preferences and defend them with genuine conviction. But those preferences are shaped by forces — sequence, context, viewing time, group dynamics, trend saturation — that are mostly invisible to everyone in the room, judges included.

Strong work does rise over time. Fresh perspectives do reshape the field. But the process is often less linear and more contingent than photographers imagine when looking at any single year's results. Understanding that has made me take both success and disappointment a little less personally.

That's probably healthy.