Rafka is a gateway service that exposes Kafka using the Redis protocol.
Using Kafka with languages that lack a reliable, solid client library can be a problem for mission-critical applications. At Skroutz we use Ruby and we faced this problem constantly.
Using Rafka we can:
- Hide Kafka low-level details from the application and provide sane defaults, backed by the excellent librdkafka.
- Use a Redis client instead of a Kafka client. This particularly useful in languages that lack a proper Kafka driver or do not provide concurrency primitives to implement buffering and other optimizations. Furthermore, writing a Rafka client is much easier than writing a Kafka client. For a Ruby driver see rafka-rb.
This project is under heavy development and is not recommended for use in production environments until we reach the 1.x series.
- Install librdkafka:
# debian $ sudo apt-get install librdkafka-dev # macOS $ brew install librdkafka
- Install Rafka:
$ go get github.com/skroutz/rafka
- Run it:
$ rafka -k kafka.json.sample [rafka] 2017/06/26 11:07:23 Spawning Consumer Manager (librdkafka 0.11.0)... [server] 2017/06/26 11:07:23 Listening on 0.0.0.0:6380
Rafka exposes a Redis protocol and tries to keep Redis semantics where possible.
We also try to design the protocol in a way that Rafka can be replaced by a plain Redis instance so that it's easier to test client code and libraries.
Kafka is designed in a way that each consumer represents a worker processing
Kafka messages, that worker sends heartbeats and is depooled from its group
when it misbehaves. Those semantics are preserved in rafka
by using
stateful connections. In rafka
a connection is tied with a set of Kafka
consumers. Consumers are not shared between connections and, once the
connection closes, the consumers are destroyed too.
Each connection first needs to identify itself by using client setname <group.id>:<name>
and then it can begin processing messages by issuing blpop
calls on the desired topics. Each message should be explicitly acknowledged
so it can be committed to Kafka. Acks are rpush
ed to the special acks
key.
Each client connection is tied to a single Producer. Producers are not shared between connections and once the connection closes, its Producer is also destroyed.
Producers can immediately begin producing using RPUSHX
, without having to call
CLIENT SETNAME
first.
Since the produced messages are buffered in Rafka and are eventually flushed
to Kafka (eg. when the client connection is closed), DUMP
can also be used to
force a synchronous flush of the messages.
$ vim kafka.json.sample # Add brokers
$ go build && ./rafka -k ./kafka.json.sample -i 10
Generic commands:
RPUSHX topics:<topic> <message>
: produce a messageDUMP <timeoutMs>
: flushes the messages to Kafka. This is a blocking operation, it returns until all buffered messages are flushed or the timeout exceeds
Example using Redis:
127.0.0.1:6380> rpushx topics:greetings "hello there!"
"OK"
CLIENT SETNAME <group.id>:<name>
: sets the consumer's group & nameCLIENT GETNAME
BLPOP topics:<topic> <timeoutMs>
: consumes the next message from the given topicRPUSH acks <topic>:<partition>:<offset>
: commit the offset for the given topic/partition
Example using Redis:
127.0.0.1:6380> client setname myapp:a-consumer
"OK"
127.0.0.1:6380> blpop topics:greetings 1000
1) "topic"
2) "greetings"
3) "partition"
4) (integer) 2
5) "offset"
6) (integer) 10
7) "value"
8) "hello there!"
# ... do some work with the greeting...
127.0.0.1:6380> rpush acks greetings:2:10
"OK"
If this is your first time setting up development on Rafka, ensure that you have all the build dependencies via dep:
$ dep ensure
Running the Go tests:
$ go test
We also have end-to-end tests that run via Docker. Refer here for more information.
Run tests (must have done make spawn
before), perform various static checks
and finally build the project:
$ make
List all available commands:
$ make list
Rafka is released under the GNU General Public License version 3. See COPYING.