Power your Kafka Streams application with Amazon MSK and AWS Fargate

Note: This blog post is a reproduction of an article I previously published on AWS Blog. I wanted to share it here for my readers who may not have seen it.

Introduction

Today, companies of all sizes across all verticals design and build event-driven architectures centered around real-time streaming and stream processing. Amazon Managed Streaming for Apache Kafka (Amazon MSK) is a fully managed service that makes it easy for you to build and run applications that use Apache Kafka to process streaming and event data. Apache Kafka is an open-source platform for building real-time streaming data pipelines and applications. With Amazon MSK, you can continue to use native Apache Kafka APIs to build event-driven architectures, stream changes to and from databases, and power machine learning and analytics applications.

Continue reading “Power your Kafka Streams application with Amazon MSK and AWS Fargate”

Apache Hive + bzip2 txt file. Why?

Hi, folks! Several weeks ago I run into illogical behavior of Apache Hive and I couldn’t find any explanations to that, so I decided to write this blog post to share those observations, hoping somebody will benefit from that(I spent half of a day debugging this) or at least could explain “why?”. So here is the story, Continue reading “Apache Hive + bzip2 txt file. Why?”