[sql_server] Initial implementation for Source Rendering #32133

ParkMyCar · 2025-04-08T21:25:06Z

This PR adds an MVP implementation of the SourceRender trait for the SQL Server source. It also adds a testdrive test that exercises replicating data from SQL Server into Materialize.

TODO(parkmycar): I need to add some more detail here

Motivation

Progress towards https://github.com/MaterializeInc/database-issues/issues/8762

Checklist

This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.

src/storage/src/source/sql_server/replication.rs

petrosagg · 2025-04-14T14:03:12Z

src/storage/src/source/sql_server/replication.rs

+            let (mut client, connection) =
+                mz_sql_server_util::Client::connect(connection_config).await?;
+            // TODO(sql_server1): Move the connection into its own future.
+            mz_ore::task::spawn(|| "sql_server-connection", async move { connection.await });


What does the TODO refer to? It seems to be its own future already. Also the async block can be reduced to just connection:

Suggested change

mz_ore::task::spawn(|| "sql_server-connection", async move { connection.await });

mz_ore::task::spawn(|| "sql_server-connection", connection);

The TODO refers to just cleaning up this API a bit, everywhere we use it we have to spawn this task and instead it would be great if the task was just spawned internally.

Also the returned Connection type implements IntoFuture not Future so it needs the async move { connection.await } block

petrosagg · 2025-04-14T14:07:24Z

src/storage/src/source/sql_server/replication.rs

+                .map(|output| {
+                    (
+                        Arc::clone(&output.capture_instance),
+                        (output.partition_index, Arc::clone(&output.decoder)),


Using a partition index and then splitting the stream into multiple ones using .partition(..) is done in the other sources because the SourceRender trait didn't support multiple outputs directly. This is now possible so since this is a greenfield source it would be nice to directly create as many outputs as necessary and drive them directly here. This will allow us to manipulate the frontiers of each outputs separately

Chatted offline, I have a local branch that does what Petros describes, I'll make this change in a follow-up PR

petrosagg · 2025-04-14T14:11:27Z

src/storage/src/source/sql_server/replication.rs

+                    // Decode the SQL Server row into an MZ one.
+                    let Some((partition_idx, decoder)) = capture_instances.get(&capture_instance)
+                    else {
+                        let definite_error = DefiniteError::ProgrammingError(format!(


What do you mean here by ProgrammingError? Is this condition only hit through a bug? If yes then I wouldn't make it a definite error since that will permanently break the source.

Normally here you would do a .expect(...) but in case of a bug I don't want to panic clusterd. Switched to a TransientError!

petrosagg · 2025-04-14T14:12:35Z

src/storage/src/source/sql_server/replication.rs

+                        return Ok(());
+                    };
+
+                    // Failing to decode data is a permanent failure.


It's not a permanent failure if the problematic row is later retracted so what we want to do here is emit the error in the output and continue as if nothing happened. The error message should contain enough data for the user to locate the problematic row (perhaps a raw representation)

Done! I also reworked decoding to return a new SqlServerDecodeError whose aim is to be a stable error type.

petrosagg · 2025-04-14T14:35:49Z