As you know the computer only does math, the compiled programs just tell the computer what functions to use based off the numbers, the CPU decodes all of the instructions coming in to it and does what is asked. Most of the time though you won't be dealing with binary but rather compiled code that runs using the help of the Kernel.
Video/text/image files are quite different, they are stored in an encoding so one of your programs can understand it, rather than the CPU or Kernel.
The question definitely relates to computer science, because most of the time all of this is managed by the Kernel, which is still software.
Its great thinking this way as a first year student, many people don't wonder at all and end up missing everything around them.