A couple of weeks ago we organized DIAMeetUp(Data-intensive application meet up) at Irvine, CA. Where I and Cahlen Humphreys talk about “High-Speed Data Processing at Scale”.
Continue reading “High-Speed Data Processing at Scale”
Posts
Apache Hive + bzip2 txt file. Why?
Hi, folks! Several weeks ago I run into illogical behavior of Apache Hive and I couldn’t find any explanations to that, so I decided to write this blog post to share those observations, hoping somebody will benefit from that(I spent half of a day debugging this) or at least could explain “why?”. So here is the story, Continue reading “Apache Hive + bzip2 txt file. Why?”
Why we need Inner and Inner static classes in Java?
Have you ever wonder why we have 4 kinds of different nested classes? What is the main difference between them? How we use it and where?
Continue reading “Why we need Inner and Inner static classes in Java?”
Elasticsearch optimistic lock(Java)
Elasticsearch is an excellent search engine, but it has few complications. One of them is updating documents/records in the index and that’s because Elasticsearch does not support ACID transactions. Elasticsearch treats all its documents as immutable. Continue reading “Elasticsearch optimistic lock(Java)”
Kafka Test Suite (Java)
One of the common issues that I had with Kafka was an integration test(Java). I was searching on the web for a library which will include not only Kafka broker but also Confluent Schema Registry. I didn’t find one, so I went to kafka-streams-examples and extracted classes needed for running Kafka cluster programmatically.
Continue reading “Kafka Test Suite (Java)”