Systems

Deep dive into popular distributed systems and databases

Apache Cassandra

Distributed NoSQL database designed for handling large amounts of data across multiple commodity servers with no single point of failure

Apache Flink

Distributed stream processing framework for real-time analytics and event-driven applications

Apache Kafka

Distributed streaming platform for building real-time data pipelines and streaming applications

Apache Spark

Unified analytics engine for large-scale data processing with built-in modules for streaming, SQL, machine learning and graph processing

Apache Storm

Real-time computation system for processing unbounded streams of data with guaranteed message processing

Top Apache Projects

Explore the most influential Apache Software Foundation projects used in modern distributed systems and data processing

Apache Cassandra

Distributed NoSQL database designed for high availability, linear scalability, and handling massive amounts of data across multiple data centers

Amazon DynamoDB

Fully managed NoSQL database service designed for high performance at any scale

Elasticsearch

Distributed search and analytics engine built on Apache Lucene for real-time search, logging, and data analytics

Apache Flink

Distributed stream processing framework for real-time analytics, event-driven applications, and complex event processing

Apache Kafka

Distributed streaming platform designed for high-throughput, real-time data pipelines and event-driven architectures

Redis

In-memory data structure store designed for high-performance caching, sessions, and real-time applications