top of page

a theory of lemons that is bananas

  • May 6
  • 9 min read

Updated: May 7

my answers to the questions Fontcuberta didn't ask



FLAMINGONE AI mockup


PetaPixel just published an essay co-authored by Boris Eldagsen and me, which should be an interesting read beyond the photographic community, with implications for anybody who consumes images. The article is a critique of Joan Fontcuberta’s “algorithmic photography,” presented in his latest book. Since we are mentioned in it, Boris and I saw it necessary to respond to an artist and intellectual who theorizes that we're past the distinction between photography and AI imagery—a hypothesis that starts with a lemon tree and ends with reality and imagination in a blender, which makes for a smoothie that tastes a bit off and is quite hard to digest.


Read our collective PetaPixel essay A Necessary Critique of Fontcuberta’s Algorithmic Photography for the full picture, discover Boris' perspective on his website, or look at it through my lens by reading on:




Joan Fontcuberta:

When I got married, some friends gave me a lemon tree […] We planted it and it grew happily. […] after twenty-five years […] the lemon tree began to produce oranges. […] A friend who is an expert in citrus fruits […] gave me a plausible explanation, […] our lemon tree had almost certainly been grafted onto a branch of an orange tree, and over time it began to reveal its true hybrid nature—non-binary and ambivalent. Personally, I preferred to keep thinking that the tree had found the courage to come out of the closet. All the more so because it seemed to me a magnificent metaphor for what is happening to photography today, which is also going through a phase in which it is about to come out. Let me explain. For two centuries, we have attributed to photography a descriptive accuracy of reality that guaranteed absolute documentary fidelity. Now, however, algorithmic photography is blending with optical photography, and we no longer know which way to turn. Immediately, we encounter a semantic and terminological problem. There are photographic images produced by cameras and photo-optical recording systems. And there are others—apparently photographic—produced through generative AI visualization systems. The former are children of chemistry and light; the latter of computing and darkness. We must therefore begin to decide whether both types of image should be considered photographic. If we focus on the processes involved, it is obvious that they are different kinds of images. Yet the difficulty of finding a word capable of classifying photorealistic representations of algorithmic origin weakens the decisiveness of that answer. These are images without a real referent—what we might call nemotypes. Some have proposed the term promptography, because such images originate from a prompt—that is, natural-language instructions given to a system in order to obtain the desired photographic result. There have been other attempts, such as syntography, but none have prevailed. When photography was shaken by the arrival of digital technology, it became necessary to specify that there had been a previous form to which a distinguishing adjective was now added: we had analog photography—or photochemical photography—versus digital photography. At that time, there was no need to invent or assign a specific new name, and nothing disastrous happened. Therefore, we could probably proceed in the same way now and still understand one another perfectly.

The lemon is a tricky fruit—linguistically speaking—beyond Fontcuberta’s allegory. In his home country, Spain, a lemon is called “limón”, whereas across Latin America, “limón” means lime. Lemons, limes, oranges – they are all citrus fruits, but likening them is not unlike comparing apples and oranges.

Here’s the thing, plain and simple, all fruits aside: this isn’t about wishy-washy linguistic interpretations of imagery and art; this is about solid scientific fact. Photography is written with light; AI imagery is written with code. The former captures the real world, the latter conjures imaginary worlds.


A linguistic disagreement on terminology does not translate into a scientific dispute around the factual difference between the processes involved in creating images, from paintings to photographs to AI pictures. There is a science to art, and it’s in the process.


The difference between analogue and digital photography could easily be summed up by a prefix because the underlying photographic process (capturing light) hadn‘t changed, only the means of how it was captured and stored (chemically vs. electronically, film vs. sensor). However, to arrive at an AI image, you have to take a completely different procedural route, which deserves a completely different name.


To disregard a giant procedural difference between two mediums in lieu of coming up with one little word to describe the new property is disproportionate and misdirected. That would be like calling every fruit that came after the banana—which came long before oranges and lemons—also banana, and that would be bananas.





Joan Fontcuberta:

[…] But the debate goes deeper: are we dealing with images belonging to different classes, or simply photographs of different rank? […] It is easy to imagine that everyone dreamed of inventing a technique capable of producing faithful representations independent of human skill—as if nature could represent itself without the mediation of pencil or brush. The camera eventually fulfilled that role, producing rigorous and detailed visual records. Since then, billions of photographs have been produced, and these images now constitute the very material used to train generative neural networks. In fact, AI functions like an ogre forced to devour enormous quantities of images in order to produce plausible results. Thus, algorithmic photographic images, although derived from the visual heritage of the entire history of photography, carry an undeniable photographic DNA. For this reason, they could reasonably be considered second-generation photographs. Roland Barthes once wrote that every photograph awaits a text. Now the situation is reversed: it is the text that generates the photograph.

Reverse engineering Fontcuberta’s example and following his argument that favors rank over class, photographs of paintings “could reasonably be considered” second-generation paintings. But if we started calling that $10 Van Gogh print from the gift shop a painting, we “could reasonably be considered” madder than the Dutch master himself.


When Microsoft had an AI hallucinate “The next Rembrandt,” and a 3D printer imitate the texture of oil on canvas, we couldn’t call the result a “painting” without putting the word in quotation marks. It’s not the real deal. In the same vein, a photorealistic AI image does not become a photograph (just like a photorealistic painting does not become a photograph).


All it takes to stop this purely dialectical carousel around rank and class is common sense—we know intuitively what’s what: paintings are paintings, photographs are photographs, and AI images are AI images, because they are derived from vastly different processes and intentions.





Joan Fontcuberta:

This terminological issue—behind which lies a deeper ontological question—came to the attention of the media when the work The Electrician, belonging to the series Pseudomnesia by the German photographer Boris Eldagsen, won the Sony World Photography Award 2023 in the “Creative” category. […] The Canadian photographer Miles Astray, specializing in nature and travel photography, reversed the logic of Eldagsen’s action: he submitted a real photograph to the newly created AI-image category of another important competition, the Color Photography Awards. […] Indeed, both cases highlight an uncomfortable but unavoidable reality: the dividing line between human creation and that generated by artificial intelligence is rapidly fading, if it has not already disappeared entirely. […] Their intention was to reveal the unreliability of validation systems in competitions of this kind. These may have been minor infractions, but they pointed toward a much more crucial issue: determining the status and labeling of images, their lineage, their pedigree. Both initiatives might appear as provocations, but in reality, they offered a necessary critique: if a photograph taken with a camera can be mistaken for an image generated by a machine – or vice versa – then we must rethink how we define the boundaries between images, and also concepts of authorship, creativity, and visual truth. Rather than making us victims of deception, these gestures provide a useful conceptual shock.

To correct all the false information in this passage—from my photographic focus and the intentions behind my stunt to the name of the competition I participated in—would go beyond the scope of this rebuttal. But it is important to point out that it is littered with false information. Facts still matter, whether they are captured in imagery or words. In fact, they matter more than ever in this post-truth epoch. If a text on the very topic of “documentary fidelity,” written by an intellectual with the best intentions, is riddled with mistakes, truth is put on its deathbed.


Admittedly, the concept of truth can be vague to begin with. Universal truths are hard to find, and personal truths—tethered to opinions—are abundant. Fontcuberta’s hybrid tree is both a lemon and an orange, depending on how you look at it. Opposing perspectives can coexist. The concept of reality is a little firmer than truth when you squeeze it; nonetheless, it remains foremost a concept as well.


Oranges are not inherently orange—their color is not a physical property but the interaction of light with their surface, which will reflect some wavelengths and absorb others. Moreover, different animal species observe different wavelengths of light, perceiving diverging realities while cohabiting the same planet. And if that wasn’t enough confusion, reality collapses into a mere probability function at the quantum level.


However, once we return from these meta realms to our human dimension, pragmaticism is of the essence. Society frays if we cannot agree on a universal fabric holding it together. If we cannot agree on certain facts, reality becomes optional, with real consequences. Powered by social media and supercharged by AI, the exponential spread of disinformation and misinformation is already starting to erode democracies and societal cohesion around the world.





Joan Fontcuberta:

Despite everything, the fundamental issue that troubles both specialists and the public concerns the credibility of images.Some wonder whether a prompt-generated photograph will one day win the World Press Photo award. But perhaps the question is wrongly framed. What should really be questioned is whether competitions like the World Press Photo still make sense. We now live in a visual regime in which images increasingly construct the world rather than simply represent it.[…] Perhaps we should even be grateful for their proliferation, because they remind us of the necessity of doubt. Algorithmic photography reinforces the idea that every image is, inevitably, an illusion and forces us to reconsider the trust we place in images.[…] Photography, therefore, has never truly been objective; we simply chose to believe that it was. Today, with AI acting as a new demiurge, documentary photography quietly slips between historical narrative and fabricated illustration. Deepfake technologies have opened Pandora’s box of iconography: thousands of hyperreal scenes and faces created from nothing flood our screens. We no longer look in order to understand—we look in order to doubt.[…] Every technology of vision has reshaped how we perceive the world. What we are witnessing today is the transition from optical realism to informational realism—a synthetic realism summoned by commands, texts, and strings of code. From Greek realism, to Renaissance perspective, to Enlightenment aspirations for accuracy, we have suddenly arrived at a condensed synthesis of all these visual regimes. And now a single prompt can generate an image that might once have required centuries of technological evolution.

AI as a new visual undercurrent won’t wash away bedrock institutions like World Press Photo. It’s in the name: world. press. photo. Three pillars AI could never shake. It cannot produce real photos of a real world for real press articles.


Of course, it’s true that photography “has never truly been objective.” A photographer‘s choices—like what is left out of a frame and therefore left out of the visual narrative—have always rendered accuracy as an approximation, which is why captions must give context to documentary images exhibited by World Press Photo.


These are natural limitations that actually increase a photographer’s ambition of documentary accuracy. Doubting the continued relevance of press photos, Fontcuberta diminishes these efforts by shrugging off important distinctions of image creation and equating photographic evidence with illustrative exemplification.


As much as photography might be limited in accuracy, AI is technologically fully incapable of recording actual events. It has no bearing on such photo awards other than to contribute to the notion that they are more relevant than ever.


The statement “we no longer look in order to understand—we look in order to doubt” is catchy. Unfortunately, sober facts can look pretty boring next to such sensational one-liners, which is exactly why the press is struggling to compete for attention with viral social media accounts. The boring truth is that we still look in order to understand—hardcoded thinking related to our survival did not change overnight when LLM algorithms hijacked our brainwork in 2022. What changed is that we need to doubt more now. And maybe Fontcuberta is somewhat right when he muses whether that’s a good thing—certainly, we could use more critical thinkers.


But we’re already halfway down a slippery slope here. Historically, the veracity of images was fairly easy to establish. The manipulation of photographs was a cumbersome darkroom process that took time and skill. There were few who mastered it and many who could debunk it. That balance shifted with digital postproduction software, and fully flipped with AI. No matter how many critical thinkers we can raise, no matter how well-trained they are, you don’t stop an unchecked flood of AI slop with critical thoughts alone. Institutional guardrails and entrepreneurial ethics must serve civil society to the same degree we hold governments and the private sector accountable with our voting and purchasing decisions.


If these actors act together, Fontcuberta’s “synthetic realism” remains but a catchy phrase that tries to shrink eons of visual history—from cave paintings to pictorial messages flying through the cosmos aboard our space probes—by squeezing them into one binary contemporary age of catchall imagery.

To depict humanity’s diverse tools and methods of visual creation as culminating in an artificial smoothie is a misrepresentation of their evolution: photography is not an evolutionary progression of painting that replaced its predecessor, and AI does not replace cameras; those mediums, tools, and processes coexist, and will continue to coexist as evolved forms of expression, the same way lemons and oranges coexist as they succeed their common citric ancestor.




The conversation around our essay is already picking up across the art community. For a rebuttal of our rebuttal check out Grégory Chatonsky’s opinion piece, which is a powerful train of thought, derailed only by the faulty tracks it runs on. I'm inspecting the crash site here.

 
 
bottom of page