This is the format — a story edit

One clip, told as a storyline.

Not a talking head with captions. Here Dr Mel's voice is the spine, and the screen cuts to a different supporting visual on every beat — cinematic B-roll, motion graphics, and Dr Mel himself — each one visualizing exactly what he's saying. Same idea as your three references.

The cut, beat by beat

0–2s"In Jamaica, we have no problems…"B-rollCinematic Jamaica coast
2–5s"…we have situations."Dr MelHis face — it's him
5–9s"we call them situations"Motion gfxproblems ✕ → situations
9–13s"if your wife catches you…"Dr MelThe joke, his delivery
13–16s"you have a situation"Motion gfxproblem → situation
16–19s"a problem… it's not solvable"B-rollLocked door, dead end
19–22s"a situation… there's always a solution"B-rollRoad opening to sunrise
endEndcardAcademy of Success

What's actually in this 22 seconds