You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Brooklin is a distributed system intended for streaming data between various heterogeneous source and destination systems with high reliability and throughput at scale. Designed for multitenancy, Brooklin can simultaneously power hundreds of data pipelines across different systems and can easily be extended to support new sources and destinations.
11
11
12
-
== Distinguishing Features
12
+
== Distinguishing features
13
13
14
-
* *Extensible for Any Source and Destination*
14
+
* *Extensible for any source and destination*
15
15
+
16
16
** Brooklin offers a flexible API that can be extended to support a wide variety of source and destination systems. It is not confined to single type of source or destination system.
17
17
** Source and destination systems can be freely mixed and matched. They do not have to be the same.
@@ -21,33 +21,33 @@ Brooklin is a distributed system intended for streaming data between various het
21
21
** Brooklin supports partitioned data streams throughout its core implementation and APIs.
22
22
** Brooklin can be deployed to a cluster of machines (scale out) to support as many data streams as desired.
23
23
24
-
* *Easy to Operate and Manage*
24
+
* *Easy to operate and manage*
25
25
** Brooklin exposes a REST endpoint for managing data streams, that offers a rich set of operations on them in addition to CRUD (e.g. `pause` and `resume`).
26
26
** Brooklin also exposes a diagnostics REST endpoint that enables on-demand querying of a data stream's status.
27
27
28
-
* *Battle-tested at Scale with Kafka*
28
+
* *Battle-tested at scale with Kafka*
29
29
** While it is not limited to any particular system, Brooklin provides capabilities for reading/writing massive amounts of data to/from Kafka with high reliability at scale. You can learn more about this in the <<Use Cases>> section.
30
30
31
-
* *Supports Change Data Capture with Bootstrap*
31
+
* *Supports Change Data Capture with bootstrap*
32
32
** Brooklin supports propagating https://en.wikipedia.org/wiki/Change_data_capture[Change Data Capture] events from data stores, e.g. RDBMS, KV stores ... etc.
33
33
** Brooklin also supports streaming a snapshot of the existing data before propagating change events.
34
34
35
-
== Use Cases
36
-
=== Mirroring Kafka Clusters
35
+
== Use cases
36
+
=== Mirroring Kafka clusters
37
37
38
38
* *Multitenancy*
39
39
+
40
40
A single Brooklin cluster can be used to mirror data across several Kafka clusters.
41
41
42
-
* *Fault Isolation Across Topic Partitions*
42
+
* *Fault isolation across topic partitions*
43
43
+
44
44
One bad partition will not affect an entire Kafka topic. Mirroring will continue for all the other healthy partitions.
45
45
46
-
* *Whitelisting Topics Using Regular Expressions*
46
+
* *Whitelisting topics using regular expressions*
47
47
+
48
48
Select the topics to mirror using regular expression patterns against their names.
49
49
50
-
* *Pausing and Resuming Individual Partitions*
50
+
* *Pausing and resuming individual partitions*
51
51
+
52
52
Through its https://github.com/linkedin/brooklin/wiki/Brooklin-Architecture#rest-endpoints[Datastream Management Service (DMS)], Brooklin exposes REST APIs that allow finer control over replication pipelines, like being able to pause and resume individual partitions of a Kafka topic.
53
53
@@ -58,7 +58,7 @@ Through its https://github.com/linkedin/brooklin/wiki/Brooklin-Architecture#rest
58
58
* Brooklin supports bootstrapping data from a datastore, i.e. streaming a snapshot of the existing data before any change events.
59
59
* MySQL support is currently under development.
60
60
61
-
=== Stream Processing Bridge
61
+
=== Stream processing bridge
62
62
* Brooklin can be used as the underlying streaming infrastructure feeding data to Stream Processing systems, e.g. http://samza.apache.org/[Apache Samza], https://storm.apache.org/[Apache Storm], https://spark.apache.org/[Apache Spark], https://flink.apache.org/[Apache Flink].
0 commit comments