Video editing

How to edit a music video: cutting to the beat and the song

Markers on the beat, performance takes synced to the track, multicam coverage, and cuts that follow the song's energy from intro to drop.

By Hanna Eng·Video editor, Free Conservatory of French Cinema

Updated 3 June 20268 min read
Part of: The craft of editing

To edit a music video, cut to the beat of the track, not to the picture: drop markers on the downbeats, sync every performance take to the song, and match your cutting energy to the structure (calm in the verse, faster in the chorus or drop). Edit to a single reference mix so the timing never drifts.

A music video is the one edit where the soundtrack is fixed before you start, and everything you cut has to answer to it. The track sets the tempo, the structure and the energy, and your job is to make the picture move with it: lips in sync, cuts on the beat, and the visual energy rising and falling exactly where the song does. This guide walks the workflow an editor uses to get there, with the audio-aware detail that comes from also mixing the music.

Song section to editing approach

Song sectionEditing approach
IntroEstablish the artist and the look. Long shots, slow cuts, set the world.
VerseHold shots longer, cut on phrase ends. Let the lyric and performance breathe.
Pre-chorus / buildShorten the cuts progressively. Tighten the rhythm to lift toward the drop.
Chorus / dropCut hardest, on the beat. Best angles, widest energy, the visual payoff.
Bridge / breakdownChange texture: a different location, speed ramp or held shot to reset the eye.
OutroWind the energy down, lengthen shots, resolve to a closing image.

Start from the track, not the footage

Every other edit is built around picture; a music video is built around sound. Before you touch a single shot, you lay the final, approved mix of the song on the timeline and lock it. That reference track is the spine of the whole edit: every cut, every sync point and every energy shift is measured against it.

Use one consistent version of the song from start to finish. If the mix changes later, even by a few frames at the top, every on-beat cut you made can drift out of sync. Get the final master on the timeline first, or at least a version whose timing will not move.

Before the edit: prep that makes a clip cut well

A music video is shaped in the edit, but the cut only sings when the prep upstream is solid. Before a single cut, the strongest clips start from a clear treatment: the artist's intent, the visual idea, and a sense of how the song's energy should map to the picture. As the editor you inherit that intent, so the more of it that reaches you, the tighter the cut.

Two things make an editor's life easier: a shot list or storyboard that says which setups exist, and the final master of the track. Knowing what was shot, and against which part of the song, lets you find the usable performance for every section instead of hunting blind through the rushes. If a treatment or director's note exists, read it before you build the timeline.

Prep also means organising the footage before you cut. Group takes by setup, label the multicam angles, and confirm every performance take was shot to the same master you will edit against. A clip where the takes were filmed to a different mix or a phone playback will drift, and no amount of editing fixes a performance shot out of sync with the track.

Find the tempo and lay down markers

Cutting on the beat is precise work, so give yourself a visible grid to cut against. Find the song's BPM (beats per minute) and place markers on the beats, or at least on the downbeats and the section changes, so the rhythm of the track is laid out on the timeline before you cut anything.

You can drop markers live by tapping a key in time with the music on a first playthrough, then tidy them up, or work to a known BPM. Either way, those markers turn an invisible rhythm into edit points you can snap a cut to, and they mark exactly where the verse becomes the chorus and where the drop lands.

  • Mark the downbeats (the 1 of each bar) as your primary cutting grid.
  • Add markers at every structural change: intro, verse, chorus, drop, bridge, outro.
  • Flag the standout musical moments (a vocal hook, a snare hit, the drop) you will cut hardest to.

Sync the performance takes to the track

On a performance clip, the artist mimes to a playback of the song on set, so every take is already roughly in time. Your job is to make it exact. Line up each performance take against the reference track so the lip movements match the vocal and the played notes match the instrument, then check the sync on a sharp, visible moment such as a hard consonant or a hit.

A waveform is your friend here: the spike of a snare or a plosive in the take should line up with the same spike in the track. Once one take is locked to the song, the others reference the same master, so you can cut freely between angles without the mime drifting off the vocal.

Cut a multicam performance

Most performance videos are shot several times from different angles to the same playback, which makes them a natural fit for a multicam edit. You sync every angle to the reference track once, group them, then switch angles live as the song plays, cutting to a new camera on the beat.

The discipline is to switch for a reason, not just to fill bars. Change angle on a structural moment (the chorus arriving), on a lyric that wants a close-up, or on a strong beat that the cut can land on. A multicam edit makes the mechanics fast; the musicality is still your decision.

  • Sync all angles to the same reference track, then group them as a multicam clip.
  • Favour the close-up on the hook and the vocal, the wide on the instrumental and the energy.
  • Cut angles on the beat, and change angle on section changes so the picture turns with the song.

On-beat and off-beat: rhythmic cutting

The simplest rhythmic cut lands on the beat: the new shot appears exactly on the downbeat, and a run of on-beat cuts feels tight, driving and locked to the track. It is the default for a chorus or a drop, where you want the picture hitting as hard as the music.

But cutting on every beat for a whole song becomes mechanical. Pros vary it: cut on the off-beat or the upbeat for a looser, syncopated feel, hold a shot across several bars to build tension, then snap back to on-beat cutting when the energy lifts. The rule is intent, not metronome: every cut is placed because the music asked for it, not because a beat went by.

Match the cutting energy to the song structure

A song is not flat, and the edit should not be either. The structure of the track (intro, verse, chorus or drop, bridge, outro) is a map of its energy, and your shot lengths should follow that map. Hold shots longer and cut more slowly in the verse, then shorten the cuts and pick up the pace into the chorus or the drop, where the cutting is fastest and the best material lands.

Think of average shot length as a dial you turn with the song: slow in the calm sections, fast in the peaks. A common build is to shorten cuts progressively through a pre-chorus so the edit accelerates into the drop, then open the shots back up in a breakdown to let the viewer breathe before the next peak. The table above maps each section to its typical approach.

Performance, narrative and the mix of both

Music videos fall into two rough types, and knowing which you are cutting changes the approach. A performance video is the artist playing or singing to camera, and lives on energy and sync. A narrative or concept video tells a story or builds a visual idea alongside the song. Most videos blend the two: performance for the chorus, narrative for the verse.

When you intercut performance with story, the track still governs the timing. Land your transitions between the two worlds on section changes or strong beats, so the structure of the song carries the structure of the video. The performance keeps the energy honest; the narrative gives the eye somewhere to go between the hooks.

Edit, then hand off clean for colour and sound

A music video almost always goes to a colour grade, often a bold, stylised one, so the edit you lock has to be ready for it. Keep your timeline organised, avoid baking heavy looks into the cut, and export or hand over a clean, conformed version so the grade starts from your real edit, not a flattened render.

The same discipline applies to the audio. Because the same person here also mixes and masters music, the edit is built knowing the final master is what plays in the video: the timeline references that mix, the sync is checked against it, and the delivered file carries the mastered track at full quality rather than a rough bounce. Picture cut to a finished mix, and a finished mix that matches the cut, is the whole point of an audio-aware edit.

Frequently asked questions

How do you edit a music video?

Lock the final mix of the song on the timeline first, find the BPM and place markers on the beats and section changes, sync your performance takes to that reference track, then cut to the beat with shot lengths that follow the song's energy: slower in the verse, fastest in the chorus or drop.

How do you cut a video to the beat of the music?

Find the song's tempo and drop markers on the downbeats, then place your cuts so a new shot lands on the beat. Snap cuts to the markers for a tight, on-beat feel, and vary it with off-beat cuts or held shots so the editing does not become mechanical.

What is BPM and why does it matter for editing?

BPM is the song's beats per minute, its tempo. Knowing it lets you place markers on every beat and build a visible rhythmic grid on the timeline, so you can cut precisely on the beat and judge how fast the editing should feel in each section.

How do you sync a performance to the music in a video?

The artist mimes to a playback on set, so each take is roughly in time. Line the take up against the final song using the waveform, matching a sharp moment such as a hard consonant or a hit, until the lip movements track the vocal. Sync every angle to the same reference track.

Should every cut land on the beat in a music video?

No. On-beat cuts feel tight and suit a chorus or a drop, but cutting on every beat for a whole song becomes mechanical. Vary the rhythm with off-beat cuts and shots held across several bars, and cut hardest only where the music peaks.

How long should shots be in a music video?

There is no fixed length; shot length should track the song's energy. Hold shots longer in the intro and verses, shorten them through a build, and cut fastest in the chorus or drop. Average shot length is a dial you turn down for calm sections and up for the peaks.

What is the difference between a performance and a narrative music video?

A performance video shows the artist playing or singing to camera and lives on energy and sync. A narrative or concept video tells a story or builds a visual idea alongside the track. Most videos blend the two, using performance for the chorus and narrative for the verse, with the song's structure governing the timing.

Does editing a music video start before the shoot?

The cut happens after the shoot, but the choices that make it possible start earlier. A clear treatment, a shot list tied to the song, and every take filmed to the same final master are what let an editor build a clip that lands. When that prep is missing, the edit spends its time fixing problems instead of building energy.

Do you need a storyboard to edit a music video?

Not strictly, but it helps. A storyboard or shot list tells the editor which setups were filmed against which part of the track, so the right performance is easy to find for every section. Without one, you reconstruct the plan from the rushes, which is slower.

Sources and references

Have a project that needs this done right?

If your mix has to pass a platform spec, let's talk about the deliverables and the timeline.

Start a project