In Defense of Audio: Why Podcast Interviews Don’t Need Video by Pat Rullo

In the age of Zoom and YouTube, it sometimes feels as though every conversation must also be a visual event. Cameras on. Lights adjusted. Backgrounds curated. But for podcast interviews, there’s a strong case to be made for something simpler—and often more powerful: audio only.
During my twelve years as a terrestrial radio host, I thrived in the vibrant atmosphere of a radio studio, learning from skilled board operators, sound engineers, and editing staff. Their expertise shaped my understanding of broadcasting. I grew to appreciate the anonymity of audio, where the true essence of storytelling rests solely in the power of words, creating a unique connection with listeners.
This topic, In Defense of Audio, comes to mind as I feel compelled to share my experience from a previous interview. The guest arrived, Zoom camera on, giving me a clear line of sight into their living room. As we engaged in conversation, my attention was repeatedly drawn to an unexpected sight: a small rodent scurrying across the floor. It would dart out from behind the door, race across the carpet, and then swiftly disappear behind a chair, only to reappear moments later. This little creature’s energetic antics persisted throughout our entire discussion. Should I bring it up? Was it a mouse, a rat, or perhaps a vole or a chipmunk? Can a stifle a laugh? Regardless, the rodent’s persistent movements made it incredibly difficult for me to focus on our conversation.
The Freedom of Not Being Seen
When a guest sits down for a video interview, a small part of their attention inevitably shifts toward appearance.
Is the lighting okay?
Is the camera angle flattering?
Is the background distracting?
How’s my hair? Oh my, do I really look like that?
Even if these concerns last only a moment, they introduce a subtle form of self-monitoring. Communication researchers describe this through Social Presence Theory—the idea that when people feel visually observed, they become more aware of how they present themselves.
Remove the camera, and something interesting happens. The pressure eases. Guests stop performing and begin speaking. Answers become less rehearsed and more reflective. Stories take their time. Pauses appear where they naturally belong.
The conversation becomes, quite simply, a conversation.
Listening Becomes the Center of the Experience
Video invites the brain to process multiple streams of information at once: facial expressions, eye contact, posture, gestures, and the environment surrounding the speaker. While this can enrich certain interactions, it also divides attention.
Psychologists describe this through Cognitive Load Theory, developed by John Sweller. The theory suggests that the brain has limited processing capacity. When multiple channels compete for attention, depth of focus can suffer.
Audio simplifies the channel.
When all that remains is the voice, both host and guest listen differently. Tone becomes more noticeable. A moment of hesitation signals uncertainty or thoughtfulness. Laughter carries warmth. A quiet pause can speak volumes.
For the host, this kind of listening often leads to better questions. For the guest, it creates space to think out loud.
The Voice Creates Intimacy
There’s another reason audio interviews have endured for nearly a century: they feel personal.
When listeners hear a voice without visuals, the brain instinctively fills in the gaps. This is a form of “social imagination.” The listener constructs a mental picture of the speaker, the setting, and even the expressions that might accompany the words. Instead of observing a conversation from the outside, the listener feels as though they are inside it.
This is why radio hosts can feel like companions on a long drive, and why audiobook narrators can hold a listener’s attention for hours. The absence of visuals invites imagination—and imagination deepens connection.
Even Video Research Suggests a Limit
As remote work exploded, researchers began studying the strain people experience from constant video calls. One of the most widely cited studies comes from Jeremy Bailenson, who examined the causes of what became known as “Zoom fatigue.”
Among the findings: video communication increases self-awareness and mental load because participants are continually aware of being watched.
Audio conversations avoid much of that pressure. Without the camera, participants relax. The interaction begins to resemble a phone call rather than a performance.
And relaxed conversations are often better conversations.
The Medium the Audience Actually Uses
There’s also a practical reality to consider.
Most podcast listeners aren’t sitting in front of a screen. They’re driving, walking the dog, folding laundry, exercising, or commuting. The podcast accompanies their day rather than demanding their full visual attention.
For these listeners, the strength of the medium isn’t video—it’s voice.
A compelling audio conversation doesn’t compete with their activity; it flows alongside it.
Returning to the Power of the Voice
Video has its place. It can be useful for promotion, discovery, or social media snippets. But when the goal is a thoughtful conversation between a host and an author—or any guest—the simplest format may still be the most effective.
Audio removes performance.
Audio sharpens listening.
Audio invites imagination.
Most importantly, audio restores the focus to what podcasting does best: two voices meeting in conversation, and a listener leaning in to hear what happens next.
So, next time we chat on the Authors on Fire podcast, feel free to wear your mismatched jammies! And invite your gerbils, too!
