Chances would be greater for real world data. It's not perfectly random. Server could record the most common byte sequences and report those.
Correct but then again I'm assuming the file is on the server for archival storage to begin with so it's probably compressed in one way or the other which does a pretty good job at removing the common sequences and making it pretty close to random.