How-To Guide

How to Shoot ASMR Content for Social Media

ASMR is the purest audio-first format on social: the triggers are almost entirely sound. So the production starts with the room and the microphone, not the camera. Grounded in PLOS ONE research.

9 min read

By Bell Chen, founder. Last updated May 24, 2026.

How to Shoot ASMR Content for Short-Form Video (Setup Guide) hero image

A 2018 study in PLOS ONE by Giulia Poerio gave ASMR its first rigorous definition: "tingling sensations in the crown of the head, in response to a range of audio-visual triggers such as whispering, tapping, and hand movements," per Poerio et al. (journals.plos.org). The same study found the effect is physiological, not imagined: "ASMR participants showed significantly greater reductions in heart rate after watching both ASMR videos compared to non-ASMR participants," per Poerio et al. The triggers that produce it are almost entirely sound, which is the single most important fact for shooting the format.

That makes ASMR the purest audio-first format on social. Fred van Leeuwen, writing in Fstoppers, put the general principle bluntly: "Technically speaking, you can have a terrible-looking video, but if you have decent sound, there's still a chance you can get away with it," per van Leeuwen (fstoppers.com). For ASMR the principle is absolute: a beautiful-looking video with thin, noisy audio fails completely, while a plain one with clean, intimate sound works. So every production decision starts with the microphone and the room.

What You'll Need

  • A small, quiet, soft-surfaced room
  • A low-self-noise microphone (binaural if possible)
  • A pop filter or foam windscreen

Time: Half a day to set up, then per-video

Why ASMR lives or dies on audio

ASMR creators record sounds at the threshold of hearing: a whisper, a fingernail tapping glass, a brush against fabric. At that level the enemies are the microphone's own self-noise and the room's ambient noise, both inaudible in normal video but roaring underneath a whisper. A phone mic or a noisy room turns delicate triggers into a hissy mess, and no amount of editing recovers it.

So the production order is the opposite of most video. The room and the microphone come first, the visuals come last, and the performance, slow and deliberate, is built to feed the mic rather than the camera. Get the audio intimate and clean and a simple shot works; get it wrong and the best visuals are wasted.

Step by step

  1. 01

    Step 1. Make the room dead quiet

    Record in a small, soft room and silence everything that hums: fans, fridge, air conditioning, phone notifications. Soft surfaces, carpet, curtains, bedding, acoustic panels, absorb the reflections that muddy close-miked sound, which is why a treated bedroom often beats a large studio for ASMR. The room noise you cannot hear at conversation volume becomes obvious the moment you record a whisper.

    Deliverable

    A near-silent, soft-surfaced recording space.

  2. 02

    Step 2. Use a low-self-noise mic, ideally binaural

    Because the sounds are so quiet, the mic's self-noise matters more than for any other format, so aim for the lowest noise floor you can afford. A binaural setup, two microphones spaced roughly at the distance between human ears, recreates the three-dimensional in-your-ears sensation that makes ASMR work in headphones, which is how most of the audience listens. This is the one piece of gear worth prioritizing over the camera.

    Deliverable

    A low-noise mic, binaural if possible, as the primary investment.

  3. 03

    Step 3. Close-mic and control plosives

    Move the source close to the mic for intimacy, and vary the distance to change intensity within a video. A pop filter or foam windscreen tames the plosives and breath blasts that close-miking exaggerates. The goal is sound that feels like it is happening next to the listener's ear, not across a room.

    Deliverable

    Intimate, plosive-free close-mic sound.

  4. 04

    Step 4. Choose triggers the research supports, and perform them slowly

    Poerio's audio-visual triggers, whispering, tapping, and hand movements, are the proven starting set; build a video around a few of them rather than a chaotic mix. Perform each slowly and deliberately, holding a sound long enough to land, because ASMR rewards patience and repetition where normal short-form rewards speed.

    Deliverable

    A small set of clear, slowly performed triggers.

  5. 05

    Step 5. Shoot the visuals to match the sound

    The camera is in service of the audio. Use soft, low light, slow deliberate movements, and tight framing on the hands and objects making the sound, so the picture reinforces the calm the audio is creating. A clean, simple shot that matches the pace of the triggers beats anything busy or fast-cut, which would break the relaxation the format exists to produce.

    Deliverable

    Slow, soft-lit visuals framed on the sound source.

What a working ASMR video does

It produces the response the research describes, the tingles and the measurable drop in heart rate, which on a platform shows up as long watch times and saves, because viewers return to videos that reliably relax them. ASMR competes on audio quality more than on reach tactics: the channels that win are the ones whose sound is consistently clean and intimate, so the audience trusts that pressing play will work.

The platform signal that matters most for the format is the one Adam Mosseri put first for Instagram reach, "Watch time," per Mosseri (instagram.com). ASMR videos are long and, when the audio is right, watched closely to the end, which is exactly what the platforms reward. The audio quality is not a finishing touch; it is the thing that earns the watch time.

The failure modes

Recording in a noisy room. The hum and ambient noise that vanish in speech roar under a whisper.

Relying on a phone or camera mic. The self-noise alone ruins threshold-level sound.

Performing too fast. ASMR rewards slow, held, repeated triggers, not speed.

Prioritizing the visuals. A pretty ASMR video with thin audio fails; the audio is the product.

What to track

Average watch time and completion, since ASMR is judged on how long and how fully it is watched.

Saves, the signal that a viewer means to return to a video that reliably relaxes them.

A headphone listen-back before publishing, the one quality gate that catches the noise speakers hide.

Where a planning-first tool fits

The room, the mic, and the performance are physical craft; no software changes them. Where a planning tool helps is deciding the format: which triggers, in what order, for how long, and how to frame a series so episodes are consistent and findable. A planning-first tool that turns a brand or channel profile into scripts and shot plans is one way to structure that, alongside the audio gear that does the real work. The method is the point; the tool is the speed dial on it. Superdirector is the planning-first tool I built around turning ideas into scripts and shot plans.

Disclosure by Bell Chen, founder of Superdirector: the planning features referenced above are part of the product I build. The procedure on this page is platform-agnostic and the tool choice is a workflow preference, not a quality requirement; the ASMR definition and physiology are sourced from Poerio et al. in PLOS ONE, the audio-first principle from Fred van Leeuwen in Fstoppers, and the watch-time signal from Adam Mosseri, all cited inline.

Frequently asked questions

What microphone is best for ASMR content?

Prioritize a low self-noise floor, because you are recording sounds at the threshold of hearing and a noisy mic drowns them. A binaural pair (two mics at roughly ear distance) recreates the three-dimensional headphone effect that defines the format. Avoid a phone or camera mic; the self-noise alone ruins delicate triggers.

Does ASMR content work for brands?

Yes. Product-focused ASMR (unboxing, texture close-ups, satisfying packaging) can beat standard product video when sound and visual texture are central to the product. Food, beauty, stationery, and craft brands are natural fits, provided the audio is clean and intimate rather than an afterthought.

Why does audio matter more than video for ASMR?

Because the triggers themselves are sound. Peer-reviewed research describes ASMR as tingling sensations produced by audio-visual triggers like whispering and tapping, and most viewers listen on headphones. A plain-looking video with clean, intimate audio works; a beautiful one with thin, noisy audio fails completely.

Start with your brand, product, profile, or video

Analyze viral ASMR content for production techniques and patterns

Generate a campaign brief

More Guides

Related Content