

Video codecs mostly work by tracking movement, predicting which pixels will change, and striving to only encode the pixels that actually change or change dramatically. In other words, compression looks for patterns.
All of that goes out the window when you try to compress static. There are no patterns. It simply can’t be compressed. This isn’t a matter of the algorithms not being good enough. It’s a fundamental limit of information theory.
Anything fancier amounts to embedding the intro into the compressor as a well-known pattern. And at that point, you’re better off just caching a 4K version of the intro as a standalone video file directly in the app.
That’s a good idea. They could probably do something similar for the audio.
They’d have to code around the rest of the animation and audio effects, but the size of that code would certainly be smaller than the rendered audio and video.