Page MenuHomePhorge

Fix podman timeout at 20mins
Needs ReviewPublic

Authored by tusooa on Fri, Dec 5, 9:08 PM.

Details

Reviewers
None
Group Reviewers
O2: lilybuild
Summary

This fixes a problem that will lead to podman run exiting prematurely at about 20mins in certain conditions.

The bug happens when:

  • The original podman-helper is used
  • The system podman sock is proxied via haproxy

It turns out that podman run without detach will wait for the container by calling the wait endpoint. Also, haproxy does not like it because it hangs for too long without producing any results, so it just terminates the connection. This leaves us in a situation where the container is still running, but podman run exits already. We cannot reproduce this problem when using podman locally or podman-remote via the unix socket directly.

Type: fix

Test Plan

Run a pipeline that has foo=0; while [ "$foo" -lt 3600 ]; do sleep 180; foo=$((foo + 180)); echo "$foo"; done. Verify that that pipeline does not fail.

Diff Detail

Repository
rB lilybuild
Lint
No Lint Coverage
Unit
No Test Coverage
Build Status
Buildable 490
Build 964: Invoke BuildbotBuildbot build #586
Build 963: arc lint + arc unit

Event Timeline

tusooa requested review of this revision.Fri, Dec 5, 9:12 PM

Fix unable to run podman for 20mins

tusooa retitled this revision from Set run script timeout to 2h to Fix podman timeout at 20mins.Mon, Dec 8, 8:53 PM
tusooa edited the summary of this revision. (Show Details)
tusooa edited the test plan for this revision. (Show Details)

Check start container status

Display error message on podman run