Big Data

Big Data Text Stream Simulation

GitHub
A simulation of n number of incoming concurrent textual streams. Inputs are stored and parsed in an ML pipeline in real time. Meant to simulate LLM streams.
Kafka Spark Hive