This is outside of my typical work, but I'm starting a new job in this realm shortly and I'm very interested in learning more about it for fun. Anyone have good books or resources for this? Thanks.
Associated HN post, although there have been a few: https://news.ycombinator.com/item?id=30710574
More technical look at video codecs in general with sources: https://github.com/leandromoreira/digital_video_introduction...
Also I learned a lot by messing with the ffmpeg source, it's pretty readable.
Sound is a “channel-pitch” stream, which relates a single-, or more different streams through connected pitches. You don’t need any programming to practice! Use a pen and ecofriendly paper: “record” your voice by “strict, composition sheet-music”: A C A Aflat etc. Recompose, rehash, and other operations can turn the sound into all known formats and related sounds!
For video, try to manipulate a single distilled image, use a point-by-point display for convenience—again, on paper. Try to split the image in 2, find which way is about equal, and develop a step-by-step algorithm: this is the most advanced ratio-encoding method. Try to convect an image with another image, which is to blend them into a single square of space, without interfering the images: one is a “C”, and the other an “o”, for example. This is the most useful single-display format, which is how avimpeg-2, hma, ncoa—which is an old vhs format— all were combining different frames. Good luck!
example https://github.com/blakeblackshear/frigate/blob/08f573aaa5ec...
One of my backburner tasks that will probably remain forever is to get a grip on DSP, obtain General/Amateur ham radio license, mess around with it, yadda yadda yadda.
I've always meant to get around to digesting this article:
https://news.ycombinator.com/item?id=9870408 ("How Shazam works")