If you're looking for a way to automatically extract this info, tools like ScreenApp's Video Analyzer use AI to transcribe audio and recognize objects within frames to help build out content descriptions.
If you can describe the of the video, I can generate: A Content Script: Dialogue, voiceover, and visual cues. AO-GMSSN1O8O.mp4
A title, description, and tags if you are planning to upload it to YouTube. If you're looking for a way to automatically