Connecting RDBs and Search Engines — Chapter 4 Part 2
Chapter 4 (Part 2): Integrating Kafka CDC Data with OpenSearch Using Flink In this chapter, we use Flink SQL to process CDC data in Debezium JSON format from Kafka and write it to OpenSearch. 1. Architecture Overview graph TD subgraph Source PG[PostgreSQL] end subgraph CDC DBZ[Debezium Connect] end subgraph Stream TOPIC[Kafka Topic] end subgraph Flink FLINK[Flink SQL Kafka Source] OS_SINK[Flink OpenSearch Sink] end subgraph Search OS[OpenSearch] end PG --> DBZ --> TOPIC --> FLINK --> OS_SINK --> OS Flink consumes CDC events from Kafka, transforms them, and upserts the data into OpenSearch. 2. Prerequisites Ensure the following OSS components are running (e.g., via Docker Compose): PostgreSQL Apache Kafka Kafka Connect (Debezium) ZooKeeper Flink (1.19) OpenSearch (2.13)

Chapter 4 (Part 2): Integrating Kafka CDC Data with OpenSearch Using Flink
In this chapter, we use Flink SQL to process CDC data in Debezium JSON format from Kafka and write it to OpenSearch.
1. Architecture Overview
graph TD
subgraph Source
PG[PostgreSQL]
end
subgraph CDC
DBZ[Debezium Connect]
end
subgraph Stream
TOPIC[Kafka Topic]
end
subgraph Flink
FLINK[Flink SQL Kafka Source]
OS_SINK[Flink OpenSearch Sink]
end
subgraph Search
OS[OpenSearch]
end
PG --> DBZ --> TOPIC --> FLINK --> OS_SINK --> OS
Flink consumes CDC events from Kafka, transforms them, and upserts the data into OpenSearch.
2. Prerequisites
Ensure the following OSS components are running (e.g., via Docker Compose):
- PostgreSQL
- Apache Kafka
- Kafka Connect (Debezium)
- ZooKeeper
- Flink (1.19)
- OpenSearch (2.13)