[adapters] Fix samply on MacOS. by ryzhyk · Pull Request #6321 · feldera/feldera

ryzhyk · 2026-05-26T15:43:45Z

Trying to run samply via the /samply_profile endpoint caused the process to deadlock on MacOS. Apparently, something on MacOS doesn't allow a process to own its own profiler. As a workaround, we change the way we launch samply on MacOS: instead of starting samply directly, we run a small shell script, which runs samply and returns its PID. We use this PID to monitor the profiler and send signals to it.

The new code only affects MacOS; on Linux we still use the simpler and well tested approach of launching samply directly.

mihaibudiu · 2026-05-26T15:51:15Z

+    let log_file = tempfile::Builder::new()
+        .prefix("samply_log_")
+        .suffix(".log")
+        .rand_bytes(10)


what are these bytes?

mythical-fred

macOS detached-subshell workaround is a clean way to dodge task_for_pid self-deadlock, and the Linux/Unix path is preserved with a tight cfg. sh_quote test covers the embedded-single-quote case. Polling at 100ms with a 30s grace + SIGKILL fallback is sensible. LGTM.

mythical-fred · 2026-05-26T16:00:25Z

+    if profile.is_empty() {
+        bail!("samply profile is empty; samply log: `{}`", log.trim());
+    }
+


log.contains("Error:") is a fragile failure signal — it relies on samply's specific log formatting and false-positives if the profiled program prints Error: to stdout (it doesn't here, but only because we redirect samply's own pipes). Since you already check profile.is_empty() above, this branch is mostly a belt-and-suspenders extra. Either drop it or capture samply's exit status (the launcher subshell could write $! and later wait $!; echo $? to a second tempfile).

mythical-fred · 2026-05-26T16:00:25Z

+    while process_exists(pid).await {
+        tokio::time::sleep(Duration::from_millis(100)).await;
+    }
+}


Nit: 100ms × N polls for process liveness is fine in practice, but on macOS kill(pid, 0) returns Ok for zombies too. If the subshell exits before samply does, samply gets reparented to launchd and you're polling launchd-owned PID which is fine — just worth a one-line comment that this is intentionally PID-not-pgid based.

blp · 2026-05-26T16:31:59Z

+    tokio::task::spawn_blocking(move || kill(Pid::from_raw(pid as i32), None).is_ok())
+        .await
+        .unwrap_or(false)
+}


spawn_blocking doesn't hurt but I'm surprised that it's useful. The kill system call shouldn't block AFAIK.

blp · 2026-05-26T16:34:31Z

+    let deadline = tokio::time::Instant::now() + timeout;
+    while tokio::time::Instant::now() < deadline {
+        if !process_exists(pid).await {
+            return true;
+        }
+        tokio::time::sleep(Duration::from_millis(100)).await;
+    }
+    false
+}


I think this can be reduced to something like tokio::time::timeout(timeout, wait_for_process_exit_unbounded(pid)).await.map_or(false, |_| true)

(Current approach is OK too.)

blp · 2026-05-26T16:38:14Z

+        .context("failed to convert samply log path to str")?;
+
+    let sh_cmd = format!(
+        "( samply record -p {target_pid} -o {} --save-only --presymbolicate > {} 2>&1 & echo $! )",


The additional subshell (the "()" around the shell commands) looks weird, but I assume it's necessary?

Trying to run samply via the /samply_profile endpoint caused the pipeline process to deadlock on MacOS. Apparently, something on MacOS doesn't allow a process to own its own profiler. As a workaround, we change the way we launch samply on MacOS: instead of starting samply directly, we run a small shell script, which runs samply and returns its PID. We use this PID to monitor the profiler and send signals to it. The new code only affects MacOS; on Linux we still use the simpler and well tested approach of launching samply directly. Signed-off-by: Leonid Ryzhyk <ryzhyk@gmail.com>

When making many small transactions, the default polling interval of 1s can be too long. This commit makes the interval configurable, while reducing the default to 0.5s. Signed-off-by: Leonid Ryzhyk <ryzhyk@gmail.com>

mythical-fred

Re-APPROVE on a18859a3 (poll_interval follow-up).

Clean addition: poll_interval_s parameter threaded through Pipeline.commit_transaction and FelderaClient.commit_transaction with default 0.5s (was hard-coded 1.0s sleep). Default tightening is appropriate for the small-transaction workload. Docstrings updated in both layers; the debug log substitutes the actual interval.

Unrelated to the samply MacOS fix — PR title now slightly underdescribes the scope, but not blocking.

No new tests for the parameter itself; it's a passthrough to time.sleep.

ryzhyk requested a review from blp May 26, 2026 15:43

ryzhyk added the connectors Issues related to the adapters/connectors crate label May 26, 2026

mihaibudiu approved these changes May 26, 2026

View reviewed changes

mythical-fred approved these changes May 26, 2026

View reviewed changes

blp approved these changes May 26, 2026

View reviewed changes

ryzhyk added 2 commits May 26, 2026 11:50

[py] Configurable polling interval for transaction commit.

a18859a

When making many small transactions, the default polling interval of 1s can be too long. This commit makes the interval configurable, while reducing the default to 0.5s. Signed-off-by: Leonid Ryzhyk <ryzhyk@gmail.com>

ryzhyk force-pushed the samply-deadlock-macos branch from 5a459ae to a18859a Compare May 26, 2026 18:51

mythical-fred approved these changes May 27, 2026

View reviewed changes

ryzhyk added this pull request to the merge queue May 28, 2026

Merged via the queue into main with commit 358e702 May 28, 2026
1 check passed

ryzhyk deleted the samply-deadlock-macos branch May 28, 2026 19:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[adapters] Fix samply on MacOS.#6321

[adapters] Fix samply on MacOS.#6321
ryzhyk merged 2 commits into
mainfrom
samply-deadlock-macos

ryzhyk commented May 26, 2026

Uh oh!

mihaibudiu May 26, 2026

Uh oh!

mythical-fred left a comment

Uh oh!

Uh oh!

mythical-fred May 26, 2026

Uh oh!

mythical-fred May 26, 2026

Uh oh!

blp May 26, 2026

Uh oh!

blp May 26, 2026

Uh oh!

blp May 26, 2026

Uh oh!

blp May 26, 2026

Uh oh!

mythical-fred left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ryzhyk commented May 26, 2026

Uh oh!

mihaibudiu May 26, 2026

Choose a reason for hiding this comment

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mythical-fred May 26, 2026

Choose a reason for hiding this comment

Uh oh!

mythical-fred May 26, 2026

Choose a reason for hiding this comment

Uh oh!

blp May 26, 2026

Choose a reason for hiding this comment

Uh oh!

blp May 26, 2026

Choose a reason for hiding this comment

Uh oh!

blp May 26, 2026

Choose a reason for hiding this comment

Uh oh!

blp May 26, 2026

Choose a reason for hiding this comment

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants