Dude, you're going to shit bricks when you realize most computer science jargon is just marketing buzzwords on top of marketing buzzwords and the terms never meant anything more or less it needed to sell a product.
For example, what the hell is big data? What is a scripting language? Is your DB web scale?
datasets large enough that it was impractical to try to store or work with them in a traditional relational database software
How I remember it is that it's not even the whole dataset that is too large, but the individual records. Hadoop for example is not doing anything magic, it's just a software package to extend MySQL to be able to efficiently have pictures (Facebook's original use case, of course it evolved) as records.
I guess big data is what you need it to justify what you want to justify. In one of my gigs' case, it was public funding for a project.
Yeah that’s a fair point, I was dealing with some records that had several 128 bit string columns recently, and it was tougher to process efficiently than a table several times its size made up of ints and bools.
Hadoop was based on MapReduce, and really just provided a framework for distributing storage and processing, there was no actual semantic layer to interact with it built in. It was a pretty foreign land for a SQL Server/Oracle admin. Hive was the data warehousing module on top of Hadoop that added HiveQL for interacting with it in a SQL like way, Spark and SparkSQL/PySpark were popular as well. It really did seem kind of magical at the time, NoSQL was a buzzword there for a while.
Wait, so, the Cloud is actually just a bunch of other computers, called servers, and the only real innovation is basically a load balancing system?
Next youre gonna tell me I wont be able to stream lagless video games and also do competitive multiplayer on my Google Stadia, pff, like youre some kind of expert or something.