Can AI handle 22-year-old code?
Benchmark of LLMs on real open-source projects against dependency hell, legacy toolchains, and complex build systems. Compare top models by success rate, cost or speed.
Last update: 17th Sep 2025
Read the blogpost: CompileBench: Can AI Compile 22-year-old Code? Short intro to the benchmark and key results.

LLMs can vibe-code and win coding contests, but can they handle real-world software issues like dependency hell, legacy toolchains or weird compile errors?

We gave 19 state-of-the-art LLMs unmodified source code of open-source projects like curl (HTTP client), jq (command-line JSON processor) and tested them on 15 real-world tasks.

The goal is simple: build a working binary from source - but getting there is hard. The toughest challenges include cross-compiling to Windows or ARM64 and resurrecting source code from 2003 on modern systems. Agents sometimes need 135 commands and 15 minutes to produce a working binary.

# Model, /
1 claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
80% / 100%
2 claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
91% / 93%
3 gpt-5-high logo gpt-5-high
87% / 93%
4 grok-4 logo grok-4
71% / 87%
5 claude-sonnet-4 logo claude-sonnet-4
78% / 80%
6 gpt-5-mini-high logo gpt-5-mini-high
76% / 80%
7 deepseek-v3.1 logo deepseek-v3.1
64% / 80%
8 gpt-5-minimal logo gpt-5-minimal
58% / 80%
9 kimi-k2-0905 logo kimi-k2-0905
58% / 80%
10 grok-code-fast-1 logo grok-code-fast-1
64% / 73%
11 glm-4.5 logo glm-4.5
49% / 73%
12 qwen3-max logo qwen3-max
53% / 67%
13 gpt-4.1-mini logo gpt-4.1-mini
47% / 67%
14 gemini-2.5-pro logo gemini-2.5-pro
53% / 60%
15 gpt-4.1 logo gpt-4.1
53% / 60%
16 gpt-oss-120b-high logo gpt-oss-120b-high
42% / 60%
17 gemini-2.5-flash logo gemini-2.5-flash
40% / 60%
18 gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
44% / 53%
19 gpt-5-mini-minimal logo gpt-5-mini-minimal
29% / 47%
pass@1
success within a single attempt
pass@3
success within 3 attempts

Each CompileBench task gives the agent:

  • Source code of an open‑source project (e.g., curl)
  • An interactive Linux terminal (Docker)
  • A clear build objective

The agent figures out the build system, patches if needed, resolves headers/libs, and picks compiler/linker flags; we then verify the binary works. Tasks range from easy builds to reviving 2003‑era code and cross‑compiling to Windows or ARM64, using projects like curl, GNU Coreutils, and jq.

In this section we compare each model's total cost across the tasks it managed to complete.
Please rotate the screen to see the chart better
Hover or tap to reveal model names. The Pareto frontier is shown as a blue line.
Pareto frontier (best price for each accuracy target):
# Min. accuracy Best model for price Total price Relative
1 ≥ 100% claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k $21.72 799.3x
2 ≥ 93% gpt-5-high logo gpt-5-high $1.99 73.3x
3 ≥ 80% gpt-5-mini-high logo gpt-5-mini-high $0.27 10.1x
4 ≥ 73% grok-code-fast-1 logo grok-code-fast-1 $0.10 3.5x
5 ≥ 47% gpt-5-mini-minimal logo gpt-5-mini-minimal $0.03 1x

Stay in the Loop!

Get exclusive updates on CompileBench results and Quesma's latest innovations

We respect your privacy. Unsubscribe anytime.
In this section we compare each model's total time across the tasks it managed to complete. We measure end-to-end time to finish tasks (LLM inference time + terminal commands execution time) - it's not just raw tokens per second, but also reflects how many commands and iterations the model needed to complete the tasks.
Please rotate the screen to see the chart better
Hover or tap to reveal model names. The Pareto frontier is shown as a blue line.
Pareto frontier (best speed for each accuracy target):
# Min. accuracy Best model for speed Total time Relative
1 ≥ 100% claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k 58m48s 7.2x
2 ≥ 93% claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k 45m40s 5.6x
3 ≥ 80% gpt-5-minimal logo gpt-5-minimal 17m16s 2.1x
4 ≥ 67% qwen3-max logo qwen3-max 11m18s 1.4x
5 ≥ 60% gpt-4.1 logo gpt-4.1 8m10s 1x
Across all tasks, the benchmark spent $299.98, sent 20460 LLM requests, and ran for 48h30m22s in total: 27h13m22s of model inference time and 19h28m34s spent in the terminal, executing 19820 commands. "Total" means we added up every attempt across tasks. Per‑task averages and details live on the task pages.
# Model
1 gpt-5-mini-minimal logo gpt-5-mini-minimal
Cost: $0.33
Time: 1h39m47s
LLM inference time: 28m23s · Command execution time: 1h01m09s
Tokens: 376k
2 grok-code-fast-1 logo grok-code-fast-1
Cost: $0.71
Time: 2h39m11s
LLM inference time: 49m19s · Command execution time: 1h48m45s
Tokens: 771k
3 gemini-2.5-flash logo gemini-2.5-flash
Cost: $2.27
Time: 1h43m36s
LLM inference time: 21m23s · Command execution time: 1h21m02s
Tokens: 905k
4 gpt-oss-120b-high logo gpt-oss-120b-high
Cost: $2.30
Time: 2h09m57s
LLM inference time: 45m33s · Command execution time: 1h05m04s
Tokens: 726k
5 gpt-5-minimal logo gpt-5-minimal
Cost: $2.47
Time: 1h25m48s
LLM inference time: 43m19s · Command execution time: 41m21s
Tokens: 602k
6 gpt-5-mini-high logo gpt-5-mini-high
Cost: $2.73
Time: 4h51m51s
LLM inference time: 3h46m03s · Command execution time: 49m56s
Tokens: 1.4M
7 gpt-4.1-mini logo gpt-4.1-mini
Cost: $3.78
Time: 1h40m39s
LLM inference time: 34m3s · Command execution time: 1h00m14s
Tokens: 761k
8 glm-4.5 logo glm-4.5
Cost: $3.86
Time: 1h23m46s
LLM inference time: 45m59s · Command execution time: 36m42s
Tokens: 527k
9 gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
Cost: $6.41
Time: 3h51m58s
LLM inference time: 1h18m03s · Command execution time: 2h31m22s
Tokens: 1.2M
10 gpt-5-high logo gpt-5-high
Cost: $8.38
Time: 3h59m26s
LLM inference time: 3h07m56s · Command execution time: 50m26s
Tokens: 1.1M
11 gemini-2.5-pro logo gemini-2.5-pro
Cost: $11.30
Time: 2h12m20s
LLM inference time: 1h17m53s · Command execution time: 53m28s
Tokens: 716k
12 gpt-4.1 logo gpt-4.1
Cost: $11.71
Time: 1h08m19s
LLM inference time: 35m16s · Command execution time: 32m3s
Tokens: 708k
13 claude-sonnet-4 logo claude-sonnet-4
Cost: $20.04
Time: 2h33m55s
LLM inference time: 1h28m29s · Command execution time: 49m32s
Tokens: 1.2M
14 deepseek-v3.1 logo deepseek-v3.1
Cost: $21.28
Time: 1h50m33s
LLM inference time: 1h09m17s · Command execution time: 40m13s
Tokens: 1M
15 claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
Cost: $22.46
Time: 2h58m12s
LLM inference time: 2h05m18s · Command execution time: 51m48s
Tokens: 1.3M
16 kimi-k2-0905 logo kimi-k2-0905
Cost: $26.42
Time: 2h09m14s
LLM inference time: 1h01m47s · Command execution time: 51m40s
Tokens: 723k
17 grok-4 logo grok-4
Cost: $28.25
Time: 4h59m49s
LLM inference time: 3h26m13s · Command execution time: 1h23m19s
Tokens: 1M
18 qwen3-max logo qwen3-max
Cost: $55.08
Time: 1h53m57s
LLM inference time: 1h00m53s · Command execution time: 51m44s
Tokens: 1.1M
19 claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
Cost: $70.20
Time: 3h18m02s
LLM inference time: 2h28m14s · Command execution time: 48m45s
Tokens: 958k
Total
Cost: $299.98
Time: 48h30m22s
LLM inference time: 27h13m22s · Command execution time: 19h28m34s
Tokens: 17.1M
A complete list of every run across models and tasks. Click any row to open the full attempt report with logs, commands, and outputs.
Model Task Status Error
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-old-version Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-old-version Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-old-version Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-old-version-alpine Failure task failed: install missing at /home/peter/result/install or not executable
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-old-version-alpine Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-old-version-alpine Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-static Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-static Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-static Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-static-alpine Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-static-alpine Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
coreutils-static-alpine Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
cowsay Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
cowsay Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
cowsay Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl-arm64-static Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl-arm64-static2 Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl-arm64-static2 Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
curl-ssl-arm64-static2 Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-static Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-static Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-static Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-static-musl Failure task failed: jq is not statically linked
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-static-musl Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-static-musl Failure task failed: jq is not statically linked
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-windows Failure task failed: jq help does not contain expected string
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-windows Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-windows Failure task failed: jq help does not contain expected string
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-windows2 Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-windows2 Success -
claude-opus-4.1-thinking-16k logo claude-opus-4.1-thinking-16k
jq-windows2 Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-old-version Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-old-version Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-old-version Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
claude-sonnet-4 logo claude-sonnet-4
coreutils-old-version-alpine Failure task failed: df missing at /home/peter/result/df or not executable
claude-sonnet-4 logo claude-sonnet-4
coreutils-old-version-alpine Failure task failed: No success reported by script: all-utils-exists.sh
claude-sonnet-4 logo claude-sonnet-4
coreutils-static Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-static Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-static Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-static-alpine Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-static-alpine Success -
claude-sonnet-4 logo claude-sonnet-4
coreutils-static-alpine Success -
claude-sonnet-4 logo claude-sonnet-4
cowsay Success -
claude-sonnet-4 logo claude-sonnet-4
cowsay Success -
claude-sonnet-4 logo claude-sonnet-4
cowsay Success -
claude-sonnet-4 logo claude-sonnet-4
curl Success -
claude-sonnet-4 logo claude-sonnet-4
curl Success -
claude-sonnet-4 logo claude-sonnet-4
curl Success -
claude-sonnet-4 logo claude-sonnet-4
curl-ssl Success -
claude-sonnet-4 logo claude-sonnet-4
curl-ssl Success -
claude-sonnet-4 logo claude-sonnet-4
curl-ssl Success -
claude-sonnet-4 logo claude-sonnet-4
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-sonnet-4 logo claude-sonnet-4
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
claude-sonnet-4 logo claude-sonnet-4
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
claude-sonnet-4 logo claude-sonnet-4
curl-ssl-arm64-static2 Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-sonnet-4 logo claude-sonnet-4
curl-ssl-arm64-static2 Failure task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported
claude-sonnet-4 logo claude-sonnet-4
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
claude-sonnet-4 logo claude-sonnet-4
jq Success -
claude-sonnet-4 logo claude-sonnet-4
jq Success -
claude-sonnet-4 logo claude-sonnet-4
jq Success -
claude-sonnet-4 logo claude-sonnet-4
jq-static Success -
claude-sonnet-4 logo claude-sonnet-4
jq-static Success -
claude-sonnet-4 logo claude-sonnet-4
jq-static Success -
claude-sonnet-4 logo claude-sonnet-4
jq-static-musl Success -
claude-sonnet-4 logo claude-sonnet-4
jq-static-musl Success -
claude-sonnet-4 logo claude-sonnet-4
jq-static-musl Success -
claude-sonnet-4 logo claude-sonnet-4
jq-windows Success -
claude-sonnet-4 logo claude-sonnet-4
jq-windows Success -
claude-sonnet-4 logo claude-sonnet-4
jq-windows Failure task failed: jq help does not contain expected string
claude-sonnet-4 logo claude-sonnet-4
jq-windows2 Success -
claude-sonnet-4 logo claude-sonnet-4
jq-windows2 Success -
claude-sonnet-4 logo claude-sonnet-4
jq-windows2 Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-old-version Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-old-version Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-old-version Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-old-version-alpine Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-old-version-alpine Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-old-version-alpine Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-static Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-static Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-static Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-static-alpine Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-static-alpine Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
coreutils-static-alpine Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
cowsay Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
cowsay Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
cowsay Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl-arm64-static2 Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl-arm64-static2 Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
curl-ssl-arm64-static2 Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-static Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-static Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-static Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-static-musl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-static-musl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-static-musl Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-windows Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-windows Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-windows Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-windows2 Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-windows2 Success -
claude-sonnet-4-thinking-16k logo claude-sonnet-4-thinking-16k
jq-windows2 Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils-old-version Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils-old-version Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils-old-version Failure task failed: install missing at /home/peter/result/install or not executable
deepseek-v3.1 logo deepseek-v3.1
coreutils-old-version-alpine Failure exceeded max tool calls (200)
deepseek-v3.1 logo deepseek-v3.1
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
deepseek-v3.1 logo deepseek-v3.1
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
deepseek-v3.1 logo deepseek-v3.1
coreutils-static Failure task failed: install missing at /home/peter/result/install or not executable
deepseek-v3.1 logo deepseek-v3.1
coreutils-static Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils-static Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils-static-alpine Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils-static-alpine Success -
deepseek-v3.1 logo deepseek-v3.1
coreutils-static-alpine Failure task failed: sha1sum binary does not exist
deepseek-v3.1 logo deepseek-v3.1
cowsay Failure task failed: Cowsay binary does not exist
deepseek-v3.1 logo deepseek-v3.1
cowsay Success -
deepseek-v3.1 logo deepseek-v3.1
cowsay Success -
deepseek-v3.1 logo deepseek-v3.1
curl Success -
deepseek-v3.1 logo deepseek-v3.1
curl Success -
deepseek-v3.1 logo deepseek-v3.1
curl Success -
deepseek-v3.1 logo deepseek-v3.1
curl-ssl Success -
deepseek-v3.1 logo deepseek-v3.1
curl-ssl Success -
deepseek-v3.1 logo deepseek-v3.1
curl-ssl Success -
deepseek-v3.1 logo deepseek-v3.1
curl-ssl-arm64-static Failure task failed: curl-arm64 is not aarch64 architecture
deepseek-v3.1 logo deepseek-v3.1
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
deepseek-v3.1 logo deepseek-v3.1
curl-ssl-arm64-static Failure task failed: curl-arm64 is not aarch64 architecture
deepseek-v3.1 logo deepseek-v3.1
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
deepseek-v3.1 logo deepseek-v3.1
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
deepseek-v3.1 logo deepseek-v3.1
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
deepseek-v3.1 logo deepseek-v3.1
jq Success -
deepseek-v3.1 logo deepseek-v3.1
jq Success -
deepseek-v3.1 logo deepseek-v3.1
jq Success -
deepseek-v3.1 logo deepseek-v3.1
jq-static Success -
deepseek-v3.1 logo deepseek-v3.1
jq-static Success -
deepseek-v3.1 logo deepseek-v3.1
jq-static Success -
deepseek-v3.1 logo deepseek-v3.1
jq-static-musl Success -
deepseek-v3.1 logo deepseek-v3.1
jq-static-musl Success -
deepseek-v3.1 logo deepseek-v3.1
jq-static-musl Success -
deepseek-v3.1 logo deepseek-v3.1
jq-windows Success -
deepseek-v3.1 logo deepseek-v3.1
jq-windows Failure task failed: jq help does not contain expected string
deepseek-v3.1 logo deepseek-v3.1
jq-windows Success -
deepseek-v3.1 logo deepseek-v3.1
jq-windows2 Success -
deepseek-v3.1 logo deepseek-v3.1
jq-windows2 Failure task failed: jq help does not contain expected string
deepseek-v3.1 logo deepseek-v3.1
jq-windows2 Failure task failed: jq help does not contain expected string
gemini-2.5-flash logo gemini-2.5-flash
coreutils Success -
gemini-2.5-flash logo gemini-2.5-flash
coreutils Failure task failed: sha1sum binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
coreutils Success -
gemini-2.5-flash logo gemini-2.5-flash
coreutils-old-version Failure exceeded max tool calls (90)
gemini-2.5-flash logo gemini-2.5-flash
coreutils-old-version Failure task failed: chroot missing at /home/peter/result/chroot or not executable
gemini-2.5-flash logo gemini-2.5-flash
coreutils-old-version Failure task failed: sha1sum binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
coreutils-static Success -
gemini-2.5-flash logo gemini-2.5-flash
coreutils-static Success -
gemini-2.5-flash logo gemini-2.5-flash
coreutils-static Failure task failed: sha1sum binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
coreutils-static-alpine Success -
gemini-2.5-flash logo gemini-2.5-flash
coreutils-static-alpine Success -
gemini-2.5-flash logo gemini-2.5-flash
coreutils-static-alpine Success -
gemini-2.5-flash logo gemini-2.5-flash
cowsay Failure task failed: Cowsay does not contain expected string (eyes)
gemini-2.5-flash logo gemini-2.5-flash
cowsay Success -
gemini-2.5-flash logo gemini-2.5-flash
cowsay Success -
gemini-2.5-flash logo gemini-2.5-flash
curl Success -
gemini-2.5-flash logo gemini-2.5-flash
curl Failure task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported
gemini-2.5-flash logo gemini-2.5-flash
curl Success -
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl Success -
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl Success -
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl Success -
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl-arm64-static Failure task failed: curl-arm64 is not aarch64 architecture
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not aarch64 architecture
gemini-2.5-flash logo gemini-2.5-flash
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
jq Success -
gemini-2.5-flash logo gemini-2.5-flash
jq Failure task failed: jq binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
jq Failure task failed: jq binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
jq-static Success -
gemini-2.5-flash logo gemini-2.5-flash
jq-static Success -
gemini-2.5-flash logo gemini-2.5-flash
jq-static Failure task failed: jq is not statically linked
gemini-2.5-flash logo gemini-2.5-flash
jq-static-musl Failure task failed: jq binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
jq-static-musl Success -
gemini-2.5-flash logo gemini-2.5-flash
jq-static-musl Failure task failed: jq binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
jq-windows Failure task failed: jq help does not contain expected string
gemini-2.5-flash logo gemini-2.5-flash
jq-windows Failure task failed: jq.exe binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
jq-windows Failure task failed: jq.exe binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
jq-windows2 Failure task failed: jq.exe binary does not exist
gemini-2.5-flash logo gemini-2.5-flash
jq-windows2 Failure task failed: jq help does not contain expected string
gemini-2.5-flash logo gemini-2.5-flash
jq-windows2 Failure task failed: jq.exe binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-old-version Failure task failed: df missing at /home/peter/result/df or not executable
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-old-version Failure context timeout: context deadline exceeded
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-old-version Failure context timeout: context deadline exceeded
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-static Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-static Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-static Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-static-alpine Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-static-alpine Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
coreutils-static-alpine Failure LLM call failed: context deadline exceeded
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
cowsay Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
cowsay Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
cowsay Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl Failure task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl-arm64-static2 Failure exceeded max tool calls (150)
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
curl-ssl-arm64-static2 Failure exceeded max tool calls (150)
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-static Failure task failed: jq is not statically linked
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-static Failure exceeded max tool calls (50)
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-static Failure task failed: jq binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-static-musl Failure task failed: jq is not statically linked
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-static-musl Failure task failed: jq binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-static-musl Success -
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-windows Failure task failed: jq.exe binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-windows Failure task failed: jq help does not contain expected string
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-windows Failure task failed: jq help does not contain expected string
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-windows2 Failure task failed: jq.exe binary does not exist
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-windows2 Failure task failed: jq help does not contain expected string
gemini-2.5-flash-thinking logo gemini-2.5-flash-thinking
jq-windows2 Failure task failed: jq.exe binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
coreutils Success -
gemini-2.5-pro logo gemini-2.5-pro
coreutils Success -
gemini-2.5-pro logo gemini-2.5-pro
coreutils Success -
gemini-2.5-pro logo gemini-2.5-pro
coreutils-old-version Failure exceeded max tool calls (90)
gemini-2.5-pro logo gemini-2.5-pro
coreutils-old-version Failure task failed: chroot missing at /home/peter/result/chroot or not executable
gemini-2.5-pro logo gemini-2.5-pro
coreutils-old-version Failure task failed: chroot missing at /home/peter/result/chroot or not executable
gemini-2.5-pro logo gemini-2.5-pro
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
coreutils-static Success -
gemini-2.5-pro logo gemini-2.5-pro
coreutils-static Success -
gemini-2.5-pro logo gemini-2.5-pro
coreutils-static Success -
gemini-2.5-pro logo gemini-2.5-pro
coreutils-static-alpine Success -
gemini-2.5-pro logo gemini-2.5-pro
coreutils-static-alpine Success -
gemini-2.5-pro logo gemini-2.5-pro
coreutils-static-alpine Success -
gemini-2.5-pro logo gemini-2.5-pro
cowsay Success -
gemini-2.5-pro logo gemini-2.5-pro
cowsay Success -
gemini-2.5-pro logo gemini-2.5-pro
cowsay Success -
gemini-2.5-pro logo gemini-2.5-pro
curl Success -
gemini-2.5-pro logo gemini-2.5-pro
curl Success -
gemini-2.5-pro logo gemini-2.5-pro
curl Success -
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl Success -
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl Success -
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl Success -
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
jq Success -
gemini-2.5-pro logo gemini-2.5-pro
jq Success -
gemini-2.5-pro logo gemini-2.5-pro
jq Success -
gemini-2.5-pro logo gemini-2.5-pro
jq-static Failure task failed: jq is not statically linked
gemini-2.5-pro logo gemini-2.5-pro
jq-static Failure task failed: jq is not statically linked
gemini-2.5-pro logo gemini-2.5-pro
jq-static Failure task failed: jq is not statically linked
gemini-2.5-pro logo gemini-2.5-pro
jq-static-musl Failure task failed: jq binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
jq-static-musl Failure task failed: jq is not statically linked
gemini-2.5-pro logo gemini-2.5-pro
jq-static-musl Failure task failed: jq is not statically linked
gemini-2.5-pro logo gemini-2.5-pro
jq-windows Success -
gemini-2.5-pro logo gemini-2.5-pro
jq-windows Failure task failed: jq help does not contain expected string
gemini-2.5-pro logo gemini-2.5-pro
jq-windows Failure task failed: jq.exe binary does not exist
gemini-2.5-pro logo gemini-2.5-pro
jq-windows2 Success -
gemini-2.5-pro logo gemini-2.5-pro
jq-windows2 Success -
gemini-2.5-pro logo gemini-2.5-pro
jq-windows2 Failure task failed: jq help does not contain expected string
glm-4.5 logo glm-4.5
coreutils Success -
glm-4.5 logo glm-4.5
coreutils Success -
glm-4.5 logo glm-4.5
coreutils Success -
glm-4.5 logo glm-4.5
coreutils-old-version Failure task failed: sha1sum binary does not exist
glm-4.5 logo glm-4.5
coreutils-old-version Success -
glm-4.5 logo glm-4.5
coreutils-old-version Failure task failed: sha1sum binary does not exist
glm-4.5 logo glm-4.5
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
glm-4.5 logo glm-4.5
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
glm-4.5 logo glm-4.5
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
glm-4.5 logo glm-4.5
coreutils-static Success -
glm-4.5 logo glm-4.5
coreutils-static Success -
glm-4.5 logo glm-4.5
coreutils-static Failure task failed: sha1sum binary does not exist
glm-4.5 logo glm-4.5
coreutils-static-alpine Success -
glm-4.5 logo glm-4.5
coreutils-static-alpine Success -
glm-4.5 logo glm-4.5
coreutils-static-alpine Success -
glm-4.5 logo glm-4.5
cowsay Success -
glm-4.5 logo glm-4.5
cowsay Success -
glm-4.5 logo glm-4.5
cowsay Success -
glm-4.5 logo glm-4.5
curl Success -
glm-4.5 logo glm-4.5
curl Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
curl Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
curl-ssl Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
curl-ssl Success -
glm-4.5 logo glm-4.5
curl-ssl Success -
glm-4.5 logo glm-4.5
curl-ssl-arm64-static Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
curl-ssl-arm64-static Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
curl-ssl-arm64-static Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
glm-4.5 logo glm-4.5
jq Success -
glm-4.5 logo glm-4.5
jq Success -
glm-4.5 logo glm-4.5
jq Failure task failed: jq binary does not exist
glm-4.5 logo glm-4.5
jq-static Success -
glm-4.5 logo glm-4.5
jq-static Success -
glm-4.5 logo glm-4.5
jq-static Failure task failed: jq is not statically linked
glm-4.5 logo glm-4.5
jq-static-musl Success -
glm-4.5 logo glm-4.5
jq-static-musl Failure task failed: jq binary does not exist
glm-4.5 logo glm-4.5
jq-static-musl Success -
glm-4.5 logo glm-4.5
jq-windows Failure task failed: jq help does not contain expected string
glm-4.5 logo glm-4.5
jq-windows Failure task failed: jq help does not contain expected string
glm-4.5 logo glm-4.5
jq-windows Success -
glm-4.5 logo glm-4.5
jq-windows2 Failure task failed: jq help does not contain expected string
glm-4.5 logo glm-4.5
jq-windows2 Failure task failed: jq help does not contain expected string
glm-4.5 logo glm-4.5
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-4.1 logo gpt-4.1
coreutils Success -
gpt-4.1 logo gpt-4.1
coreutils Success -
gpt-4.1 logo gpt-4.1
coreutils Success -
gpt-4.1 logo gpt-4.1
coreutils-old-version Failure task failed: install missing at /home/peter/result/install or not executable
gpt-4.1 logo gpt-4.1
coreutils-old-version Failure task failed: sha1sum binary does not exist
gpt-4.1 logo gpt-4.1
coreutils-old-version Failure task failed: groups missing at /home/peter/result/groups or not executable
gpt-4.1 logo gpt-4.1
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-4.1 logo gpt-4.1
coreutils-old-version-alpine Failure task failed: df missing at /home/peter/result/df or not executable
gpt-4.1 logo gpt-4.1
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-4.1 logo gpt-4.1
coreutils-static Success -
gpt-4.1 logo gpt-4.1
coreutils-static Success -
gpt-4.1 logo gpt-4.1
coreutils-static Success -
gpt-4.1 logo gpt-4.1
coreutils-static-alpine Success -
gpt-4.1 logo gpt-4.1
coreutils-static-alpine Failure task failed: kill missing at /home/peter/result/kill or not executable
gpt-4.1 logo gpt-4.1
coreutils-static-alpine Success -
gpt-4.1 logo gpt-4.1
cowsay Success -
gpt-4.1 logo gpt-4.1
cowsay Success -
gpt-4.1 logo gpt-4.1
cowsay Success -
gpt-4.1 logo gpt-4.1
curl Success -
gpt-4.1 logo gpt-4.1
curl Success -
gpt-4.1 logo gpt-4.1
curl Success -
gpt-4.1 logo gpt-4.1
curl-ssl Success -
gpt-4.1 logo gpt-4.1
curl-ssl Success -
gpt-4.1 logo gpt-4.1
curl-ssl Success -
gpt-4.1 logo gpt-4.1
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gpt-4.1 logo gpt-4.1
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gpt-4.1 logo gpt-4.1
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gpt-4.1 logo gpt-4.1
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-4.1 logo gpt-4.1
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-4.1 logo gpt-4.1
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-4.1 logo gpt-4.1
jq Success -
gpt-4.1 logo gpt-4.1
jq Success -
gpt-4.1 logo gpt-4.1
jq Success -
gpt-4.1 logo gpt-4.1
jq-static Success -
gpt-4.1 logo gpt-4.1
jq-static Failure task failed: jq is not statically linked
gpt-4.1 logo gpt-4.1
jq-static Success -
gpt-4.1 logo gpt-4.1
jq-static-musl Failure task failed: jq is not statically linked
gpt-4.1 logo gpt-4.1
jq-static-musl Failure task failed: jq is not statically linked
gpt-4.1 logo gpt-4.1
jq-static-musl Failure task failed: jq is not statically linked
gpt-4.1 logo gpt-4.1
jq-windows Failure task failed: jq help does not contain expected string
gpt-4.1 logo gpt-4.1
jq-windows Failure task failed: jq help does not contain expected string
gpt-4.1 logo gpt-4.1
jq-windows Failure task failed: jq help does not contain expected string
gpt-4.1 logo gpt-4.1
jq-windows2 Success -
gpt-4.1 logo gpt-4.1
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-4.1 logo gpt-4.1
jq-windows2 Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils-old-version Failure exceeded max tool calls (90)
gpt-4.1-mini logo gpt-4.1-mini
coreutils-old-version Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils-old-version Failure exceeded max tool calls (90)
gpt-4.1-mini logo gpt-4.1-mini
coreutils-old-version-alpine Failure exceeded max tool calls (200)
gpt-4.1-mini logo gpt-4.1-mini
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-4.1-mini logo gpt-4.1-mini
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-4.1-mini logo gpt-4.1-mini
coreutils-static Failure task failed: install missing at /home/peter/result/install or not executable
gpt-4.1-mini logo gpt-4.1-mini
coreutils-static Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils-static Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils-static-alpine Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils-static-alpine Success -
gpt-4.1-mini logo gpt-4.1-mini
coreutils-static-alpine Success -
gpt-4.1-mini logo gpt-4.1-mini
cowsay Failure task failed: Cowsay does not contain expected string (eyes)
gpt-4.1-mini logo gpt-4.1-mini
cowsay Success -
gpt-4.1-mini logo gpt-4.1-mini
cowsay Failure context timeout: context deadline exceeded
gpt-4.1-mini logo gpt-4.1-mini
curl Success -
gpt-4.1-mini logo gpt-4.1-mini
curl Success -
gpt-4.1-mini logo gpt-4.1-mini
curl Success -
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl Success -
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl Success -
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl Success -
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-4.1-mini logo gpt-4.1-mini
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-4.1-mini logo gpt-4.1-mini
jq Success -
gpt-4.1-mini logo gpt-4.1-mini
jq Success -
gpt-4.1-mini logo gpt-4.1-mini
jq Success -
gpt-4.1-mini logo gpt-4.1-mini
jq-static Failure task failed: jq is not statically linked
gpt-4.1-mini logo gpt-4.1-mini
jq-static Failure task failed: jq is not statically linked
gpt-4.1-mini logo gpt-4.1-mini
jq-static Failure task failed: jq is not statically linked
gpt-4.1-mini logo gpt-4.1-mini
jq-static-musl Failure task failed: jq is not statically linked
gpt-4.1-mini logo gpt-4.1-mini
jq-static-musl Failure task failed: jq is not statically linked
gpt-4.1-mini logo gpt-4.1-mini
jq-static-musl Failure task failed: jq is not statically linked
gpt-4.1-mini logo gpt-4.1-mini
jq-windows Success -
gpt-4.1-mini logo gpt-4.1-mini
jq-windows Failure task failed: jq help does not contain expected string
gpt-4.1-mini logo gpt-4.1-mini
jq-windows Failure task failed: jq help does not contain expected string
gpt-4.1-mini logo gpt-4.1-mini
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-4.1-mini logo gpt-4.1-mini
jq-windows2 Success -
gpt-4.1-mini logo gpt-4.1-mini
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-5-high logo gpt-5-high
coreutils Success -
gpt-5-high logo gpt-5-high
coreutils Success -
gpt-5-high logo gpt-5-high
coreutils Success -
gpt-5-high logo gpt-5-high
coreutils-old-version Success -
gpt-5-high logo gpt-5-high
coreutils-old-version Success -
gpt-5-high logo gpt-5-high
coreutils-old-version Success -
gpt-5-high logo gpt-5-high
coreutils-old-version-alpine Success -
gpt-5-high logo gpt-5-high
coreutils-old-version-alpine Failure task failed: df missing at /home/peter/result/df or not executable
gpt-5-high logo gpt-5-high
coreutils-old-version-alpine Success -
gpt-5-high logo gpt-5-high
coreutils-static Success -
gpt-5-high logo gpt-5-high
coreutils-static Success -
gpt-5-high logo gpt-5-high
coreutils-static Success -
gpt-5-high logo gpt-5-high
coreutils-static-alpine Success -
gpt-5-high logo gpt-5-high
coreutils-static-alpine Success -
gpt-5-high logo gpt-5-high
coreutils-static-alpine Failure exceeded max tool calls (50)
gpt-5-high logo gpt-5-high
cowsay Success -
gpt-5-high logo gpt-5-high
cowsay Success -
gpt-5-high logo gpt-5-high
cowsay Success -
gpt-5-high logo gpt-5-high
curl Success -
gpt-5-high logo gpt-5-high
curl Success -
gpt-5-high logo gpt-5-high
curl Success -
gpt-5-high logo gpt-5-high
curl-ssl Success -
gpt-5-high logo gpt-5-high
curl-ssl Success -
gpt-5-high logo gpt-5-high
curl-ssl Success -
gpt-5-high logo gpt-5-high
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-high logo gpt-5-high
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-high logo gpt-5-high
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-high logo gpt-5-high
curl-ssl-arm64-static2 Success -
gpt-5-high logo gpt-5-high
curl-ssl-arm64-static2 Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-high logo gpt-5-high
curl-ssl-arm64-static2 Success -
gpt-5-high logo gpt-5-high
jq Success -
gpt-5-high logo gpt-5-high
jq Success -
gpt-5-high logo gpt-5-high
jq Success -
gpt-5-high logo gpt-5-high
jq-static Success -
gpt-5-high logo gpt-5-high
jq-static Success -
gpt-5-high logo gpt-5-high
jq-static Success -
gpt-5-high logo gpt-5-high
jq-static-musl Success -
gpt-5-high logo gpt-5-high
jq-static-musl Success -
gpt-5-high logo gpt-5-high
jq-static-musl Success -
gpt-5-high logo gpt-5-high
jq-windows Success -
gpt-5-high logo gpt-5-high
jq-windows Success -
gpt-5-high logo gpt-5-high
jq-windows Success -
gpt-5-high logo gpt-5-high
jq-windows2 Success -
gpt-5-high logo gpt-5-high
jq-windows2 Success -
gpt-5-high logo gpt-5-high
jq-windows2 Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils-old-version Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils-old-version Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils-old-version Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-5-mini-high logo gpt-5-mini-high
coreutils-old-version-alpine Failure task failed: No success reported by script: all-utils-exists.sh
gpt-5-mini-high logo gpt-5-mini-high
coreutils-old-version-alpine Failure task failed: df missing at /home/peter/result/df or not executable
gpt-5-mini-high logo gpt-5-mini-high
coreutils-static Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils-static Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils-static Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils-static-alpine Failure task failed: sha1sum binary does not exist
gpt-5-mini-high logo gpt-5-mini-high
coreutils-static-alpine Success -
gpt-5-mini-high logo gpt-5-mini-high
coreutils-static-alpine Success -
gpt-5-mini-high logo gpt-5-mini-high
cowsay Success -
gpt-5-mini-high logo gpt-5-mini-high
cowsay Success -
gpt-5-mini-high logo gpt-5-mini-high
cowsay Success -
gpt-5-mini-high logo gpt-5-mini-high
curl Success -
gpt-5-mini-high logo gpt-5-mini-high
curl Success -
gpt-5-mini-high logo gpt-5-mini-high
curl Success -
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl Success -
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl Success -
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl Success -
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl-arm64-static2 Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl-arm64-static2 Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-mini-high logo gpt-5-mini-high
curl-ssl-arm64-static2 Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
gpt-5-mini-high logo gpt-5-mini-high
jq Success -
gpt-5-mini-high logo gpt-5-mini-high
jq Success -
gpt-5-mini-high logo gpt-5-mini-high
jq Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-static Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-static Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-static Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-static-musl Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-static-musl Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-static-musl Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-windows Failure task failed: jq help does not contain expected string
gpt-5-mini-high logo gpt-5-mini-high
jq-windows Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-windows Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-windows2 Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-windows2 Success -
gpt-5-mini-high logo gpt-5-mini-high
jq-windows2 Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils Failure task failed: sha1sum binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils Failure task failed: sha1sum binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-old-version Failure task failed: sha1sum binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-old-version Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-old-version Failure task failed: sha1sum binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-static Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-static Failure task failed: sha1sum is not statically linked
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-static Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-static-alpine Failure task failed: kill missing at /home/peter/result/kill or not executable
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-static-alpine Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
coreutils-static-alpine Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
cowsay Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
cowsay Failure context timeout: context deadline exceeded
gpt-5-mini-minimal logo gpt-5-mini-minimal
cowsay Failure context timeout: context deadline exceeded
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl Failure task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl Failure task failed: curl binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl Failure task failed: curl binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl-arm64-static Failure task failed: curl binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl-arm64-static Failure task failed: curl-arm64 is not aarch64 architecture
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq Success -
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-static Failure task failed: jq is not statically linked
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-static Failure task failed: jq binary does not exist
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-static Failure task failed: jq is not statically linked
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-static-musl Failure task failed: jq is not statically linked
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-static-musl Failure task failed: jq is not statically linked
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-static-musl Failure task failed: jq is not statically linked
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-windows Failure task failed: jq help does not contain expected string
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-windows Failure task failed: jq help does not contain expected string
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-windows Failure task failed: jq help does not contain expected string
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-5-mini-minimal logo gpt-5-mini-minimal
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-5-minimal logo gpt-5-minimal
coreutils Failure task failed: cat missing at /home/peter/result/cat or not executable
gpt-5-minimal logo gpt-5-minimal
coreutils Success -
gpt-5-minimal logo gpt-5-minimal
coreutils Success -
gpt-5-minimal logo gpt-5-minimal
coreutils-old-version Success -
gpt-5-minimal logo gpt-5-minimal
coreutils-old-version Success -
gpt-5-minimal logo gpt-5-minimal
coreutils-old-version Success -
gpt-5-minimal logo gpt-5-minimal
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-5-minimal logo gpt-5-minimal
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-5-minimal logo gpt-5-minimal
coreutils-old-version-alpine Failure task failed: df missing at /home/peter/result/df or not executable
gpt-5-minimal logo gpt-5-minimal
coreutils-static Success -
gpt-5-minimal logo gpt-5-minimal
coreutils-static Failure task failed: install missing at /home/peter/result/install or not executable
gpt-5-minimal logo gpt-5-minimal
coreutils-static Failure task failed: install missing at /home/peter/result/install or not executable
gpt-5-minimal logo gpt-5-minimal
coreutils-static-alpine Failure task failed: kill missing at /home/peter/result/kill or not executable
gpt-5-minimal logo gpt-5-minimal
coreutils-static-alpine Failure task failed: install missing at /home/peter/result/install or not executable
gpt-5-minimal logo gpt-5-minimal
coreutils-static-alpine Success -
gpt-5-minimal logo gpt-5-minimal
cowsay Success -
gpt-5-minimal logo gpt-5-minimal
cowsay Success -
gpt-5-minimal logo gpt-5-minimal
cowsay Failure task failed: Cowsay help does not contain expected string
gpt-5-minimal logo gpt-5-minimal
curl Success -
gpt-5-minimal logo gpt-5-minimal
curl Success -
gpt-5-minimal logo gpt-5-minimal
curl Success -
gpt-5-minimal logo gpt-5-minimal
curl-ssl Success -
gpt-5-minimal logo gpt-5-minimal
curl-ssl Success -
gpt-5-minimal logo gpt-5-minimal
curl-ssl Success -
gpt-5-minimal logo gpt-5-minimal
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gpt-5-minimal logo gpt-5-minimal
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gpt-5-minimal logo gpt-5-minimal
curl-ssl-arm64-static Failure task failed: curl-arm64 binary does not exist
gpt-5-minimal logo gpt-5-minimal
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
gpt-5-minimal logo gpt-5-minimal
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
gpt-5-minimal logo gpt-5-minimal
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not aarch64 architecture
gpt-5-minimal logo gpt-5-minimal
jq Success -
gpt-5-minimal logo gpt-5-minimal
jq Success -
gpt-5-minimal logo gpt-5-minimal
jq Success -
gpt-5-minimal logo gpt-5-minimal
jq-static Success -
gpt-5-minimal logo gpt-5-minimal
jq-static Failure task failed: jq is not statically linked
gpt-5-minimal logo gpt-5-minimal
jq-static Success -
gpt-5-minimal logo gpt-5-minimal
jq-static-musl Success -
gpt-5-minimal logo gpt-5-minimal
jq-static-musl Failure task failed: jq is not statically linked
gpt-5-minimal logo gpt-5-minimal
jq-static-musl Success -
gpt-5-minimal logo gpt-5-minimal
jq-windows Success -
gpt-5-minimal logo gpt-5-minimal
jq-windows Success -
gpt-5-minimal logo gpt-5-minimal
jq-windows Success -
gpt-5-minimal logo gpt-5-minimal
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-5-minimal logo gpt-5-minimal
jq-windows2 Failure task failed: jq.exe is not an amd64 Windows executable
gpt-5-minimal logo gpt-5-minimal
jq-windows2 Success -
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils Success -
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils Success -
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils Success -
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-old-version Failure task failed: sha1sum binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-old-version Failure task failed: sha1sum binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-old-version Failure task failed: sha1sum binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-static Success -
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-static Success -
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-static Success -
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-static-alpine Success -
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-static-alpine Failure task failed: sha1sum binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
coreutils-static-alpine Success -
gpt-oss-120b-high logo gpt-oss-120b-high
cowsay Failure task failed: Cowsay binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
cowsay Success -
gpt-oss-120b-high logo gpt-oss-120b-high
cowsay Success -
gpt-oss-120b-high logo gpt-oss-120b-high
curl Failure task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported
gpt-oss-120b-high logo gpt-oss-120b-high
curl Success -
gpt-oss-120b-high logo gpt-oss-120b-high
curl Failure task failed: curl binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl Success -
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl Failure task failed: curl binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl Success -
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
gpt-oss-120b-high logo gpt-oss-120b-high
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
jq Success -
gpt-oss-120b-high logo gpt-oss-120b-high
jq Success -
gpt-oss-120b-high logo gpt-oss-120b-high
jq Success -
gpt-oss-120b-high logo gpt-oss-120b-high
jq-static Success -
gpt-oss-120b-high logo gpt-oss-120b-high
jq-static Failure task failed: jq is not statically linked
gpt-oss-120b-high logo gpt-oss-120b-high
jq-static Failure task failed: jq binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
jq-static-musl Success -
gpt-oss-120b-high logo gpt-oss-120b-high
jq-static-musl Success -
gpt-oss-120b-high logo gpt-oss-120b-high
jq-static-musl Failure task failed: jq binary does not exist
gpt-oss-120b-high logo gpt-oss-120b-high
jq-windows Failure task failed: jq help does not contain expected string
gpt-oss-120b-high logo gpt-oss-120b-high
jq-windows Failure task failed: jq help does not contain expected string
gpt-oss-120b-high logo gpt-oss-120b-high
jq-windows Failure task failed: jq help does not contain expected string
gpt-oss-120b-high logo gpt-oss-120b-high
jq-windows2 Failure task failed: jq help does not contain expected string
gpt-oss-120b-high logo gpt-oss-120b-high
jq-windows2 Failure context timeout: context deadline exceeded
gpt-oss-120b-high logo gpt-oss-120b-high
jq-windows2 Failure task failed: jq.exe binary does not exist
grok-4 logo grok-4
coreutils Success -
grok-4 logo grok-4
coreutils Success -
grok-4 logo grok-4
coreutils Success -
grok-4 logo grok-4
coreutils-old-version Success -
grok-4 logo grok-4
coreutils-old-version Success -
grok-4 logo grok-4
coreutils-old-version Success -
grok-4 logo grok-4
coreutils-old-version-alpine Failure task failed: df missing at /home/peter/result/df or not executable
grok-4 logo grok-4
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
grok-4 logo grok-4
coreutils-old-version-alpine Failure task failed: df missing at /home/peter/result/df or not executable
grok-4 logo grok-4
coreutils-static Success -
grok-4 logo grok-4
coreutils-static Success -
grok-4 logo grok-4
coreutils-static Success -
grok-4 logo grok-4
coreutils-static-alpine Success -
grok-4 logo grok-4
coreutils-static-alpine Success -
grok-4 logo grok-4
coreutils-static-alpine Failure context timeout: context deadline exceeded
grok-4 logo grok-4
cowsay Success -
grok-4 logo grok-4
cowsay Success -
grok-4 logo grok-4
cowsay Success -
grok-4 logo grok-4
curl Success -
grok-4 logo grok-4
curl Success -
grok-4 logo grok-4
curl Success -
grok-4 logo grok-4
curl-ssl Success -
grok-4 logo grok-4
curl-ssl Success -
grok-4 logo grok-4
curl-ssl Success -
grok-4 logo grok-4
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
grok-4 logo grok-4
curl-ssl-arm64-static Failure task failed: curl HTTPS request to google.com did not return content-type: text/html but instead: } [2 bytes data] * SSL...
grok-4 logo grok-4
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
grok-4 logo grok-4
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
grok-4 logo grok-4
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
grok-4 logo grok-4
curl-ssl-arm64-static2 Success -
grok-4 logo grok-4
jq Success -
grok-4 logo grok-4
jq Success -
grok-4 logo grok-4
jq Success -
grok-4 logo grok-4
jq-static Success -
grok-4 logo grok-4
jq-static Success -
grok-4 logo grok-4
jq-static Success -
grok-4 logo grok-4
jq-static-musl Failure task failed: jq binary does not exist
grok-4 logo grok-4
jq-static-musl Success -
grok-4 logo grok-4
jq-static-musl Failure task failed: jq binary does not exist
grok-4 logo grok-4
jq-windows Success -
grok-4 logo grok-4
jq-windows Failure task failed: jq help does not contain expected string
grok-4 logo grok-4
jq-windows Success -
grok-4 logo grok-4
jq-windows2 Success -
grok-4 logo grok-4
jq-windows2 Failure task failed: jq help does not contain expected string
grok-4 logo grok-4
jq-windows2 Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils-old-version Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils-old-version Failure task failed: install missing at /home/peter/result/install or not executable
grok-code-fast-1 logo grok-code-fast-1
coreutils-old-version Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
grok-code-fast-1 logo grok-code-fast-1
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
grok-code-fast-1 logo grok-code-fast-1
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
grok-code-fast-1 logo grok-code-fast-1
coreutils-static Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils-static Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils-static Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils-static-alpine Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils-static-alpine Success -
grok-code-fast-1 logo grok-code-fast-1
coreutils-static-alpine Success -
grok-code-fast-1 logo grok-code-fast-1
cowsay Success -
grok-code-fast-1 logo grok-code-fast-1
cowsay Success -
grok-code-fast-1 logo grok-code-fast-1
cowsay Success -
grok-code-fast-1 logo grok-code-fast-1
curl Success -
grok-code-fast-1 logo grok-code-fast-1
curl Failure task failed: curl did not download the expected local file content, but instead: curl: (1) Protocol "file" not supported
grok-code-fast-1 logo grok-code-fast-1
curl Success -
grok-code-fast-1 logo grok-code-fast-1
curl-ssl Success -
grok-code-fast-1 logo grok-code-fast-1
curl-ssl Success -
grok-code-fast-1 logo grok-code-fast-1
curl-ssl Success -
grok-code-fast-1 logo grok-code-fast-1
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
grok-code-fast-1 logo grok-code-fast-1
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
grok-code-fast-1 logo grok-code-fast-1
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
grok-code-fast-1 logo grok-code-fast-1
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
grok-code-fast-1 logo grok-code-fast-1
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
grok-code-fast-1 logo grok-code-fast-1
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
grok-code-fast-1 logo grok-code-fast-1
jq Success -
grok-code-fast-1 logo grok-code-fast-1
jq Success -
grok-code-fast-1 logo grok-code-fast-1
jq Success -
grok-code-fast-1 logo grok-code-fast-1
jq-static Success -
grok-code-fast-1 logo grok-code-fast-1
jq-static Success -
grok-code-fast-1 logo grok-code-fast-1
jq-static Success -
grok-code-fast-1 logo grok-code-fast-1
jq-static-musl Success -
grok-code-fast-1 logo grok-code-fast-1
jq-static-musl Failure task failed: jq is not statically linked
grok-code-fast-1 logo grok-code-fast-1
jq-static-musl Success -
grok-code-fast-1 logo grok-code-fast-1
jq-windows Failure task failed: jq help does not contain expected string
grok-code-fast-1 logo grok-code-fast-1
jq-windows Failure task failed: jq help does not contain expected string
grok-code-fast-1 logo grok-code-fast-1
jq-windows Failure task failed: jq help does not contain expected string
grok-code-fast-1 logo grok-code-fast-1
jq-windows2 Failure task failed: jq help does not contain expected string
grok-code-fast-1 logo grok-code-fast-1
jq-windows2 Success -
grok-code-fast-1 logo grok-code-fast-1
jq-windows2 Success -
kimi-k2-0905 logo kimi-k2-0905
coreutils Success -
kimi-k2-0905 logo kimi-k2-0905
coreutils Success -
kimi-k2-0905 logo kimi-k2-0905
coreutils Failure task failed: sha1sum binary does not exist
kimi-k2-0905 logo kimi-k2-0905
coreutils-old-version Success -
kimi-k2-0905 logo kimi-k2-0905
coreutils-old-version Failure task failed: chroot missing at /home/peter/result/chroot or not executable
kimi-k2-0905 logo kimi-k2-0905
coreutils-old-version Success -
kimi-k2-0905 logo kimi-k2-0905
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
kimi-k2-0905 logo kimi-k2-0905
coreutils-old-version-alpine Failure task failed: No success reported by script: all-utils-exists.sh
kimi-k2-0905 logo kimi-k2-0905
coreutils-old-version-alpine Failure task failed: sha1sum binary does not exist
kimi-k2-0905 logo kimi-k2-0905
coreutils-static Failure task failed: sha1sum binary does not exist
kimi-k2-0905 logo kimi-k2-0905
coreutils-static Failure task failed: sha1sum binary does not exist
kimi-k2-0905 logo kimi-k2-0905
coreutils-static Success -
kimi-k2-0905 logo kimi-k2-0905
coreutils-static-alpine Success -
kimi-k2-0905 logo kimi-k2-0905
coreutils-static-alpine Success -
kimi-k2-0905 logo kimi-k2-0905
coreutils-static-alpine Success -
kimi-k2-0905 logo kimi-k2-0905
cowsay Success -
kimi-k2-0905 logo kimi-k2-0905
cowsay Success -
kimi-k2-0905 logo kimi-k2-0905
cowsay Success -
kimi-k2-0905 logo kimi-k2-0905
curl Success -
kimi-k2-0905 logo kimi-k2-0905
curl Success -
kimi-k2-0905 logo kimi-k2-0905
curl Success -
kimi-k2-0905 logo kimi-k2-0905
curl-ssl Success -
kimi-k2-0905 logo kimi-k2-0905
curl-ssl Success -
kimi-k2-0905 logo kimi-k2-0905
curl-ssl Success -
kimi-k2-0905 logo kimi-k2-0905
curl-ssl-arm64-static Failure task failed: curl binary does not exist
kimi-k2-0905 logo kimi-k2-0905
curl-ssl-arm64-static Failure task failed: curl binary does not exist
kimi-k2-0905 logo kimi-k2-0905
curl-ssl-arm64-static Failure task failed: curl-arm64 is not aarch64 architecture
kimi-k2-0905 logo kimi-k2-0905
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
kimi-k2-0905 logo kimi-k2-0905
curl-ssl-arm64-static2 Failure task failed: curl binary does not exist
kimi-k2-0905 logo kimi-k2-0905
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
kimi-k2-0905 logo kimi-k2-0905
jq Failure task failed: jq binary does not exist
kimi-k2-0905 logo kimi-k2-0905
jq Success -
kimi-k2-0905 logo kimi-k2-0905
jq Success -
kimi-k2-0905 logo kimi-k2-0905
jq-static Success -
kimi-k2-0905 logo kimi-k2-0905
jq-static Success -
kimi-k2-0905 logo kimi-k2-0905
jq-static Success -
kimi-k2-0905 logo kimi-k2-0905
jq-static-musl Failure task failed: jq is not statically linked
kimi-k2-0905 logo kimi-k2-0905
jq-static-musl Failure task failed: jq is not statically linked
kimi-k2-0905 logo kimi-k2-0905
jq-static-musl Success -
kimi-k2-0905 logo kimi-k2-0905
jq-windows Failure task failed: jq help does not contain expected string
kimi-k2-0905 logo kimi-k2-0905
jq-windows Success -
kimi-k2-0905 logo kimi-k2-0905
jq-windows Success -
kimi-k2-0905 logo kimi-k2-0905
jq-windows2 Failure task failed: jq help does not contain expected string
kimi-k2-0905 logo kimi-k2-0905
jq-windows2 Failure task failed: jq help does not contain expected string
kimi-k2-0905 logo kimi-k2-0905
jq-windows2 Success -
qwen3-max logo qwen3-max
coreutils Success -
qwen3-max logo qwen3-max
coreutils Success -
qwen3-max logo qwen3-max
coreutils Success -
qwen3-max logo qwen3-max
coreutils-old-version Failure exceeded max cost dollars (max=$3.00, current=3.01)
qwen3-max logo qwen3-max
coreutils-old-version Success -
qwen3-max logo qwen3-max
coreutils-old-version Success -
qwen3-max logo qwen3-max
coreutils-old-version-alpine Failure task failed: df missing at /home/peter/result/df or not executable
qwen3-max logo qwen3-max
coreutils-old-version-alpine Failure exceeded max cost dollars (max=$10.00, current=10.25)
qwen3-max logo qwen3-max
coreutils-old-version-alpine Failure exceeded max cost dollars (max=$10.00, current=10.35)
qwen3-max logo qwen3-max
coreutils-static Success -
qwen3-max logo qwen3-max
coreutils-static Failure task failed: sha1sum is not statically linked
qwen3-max logo qwen3-max
coreutils-static Failure task failed: sha1sum is not statically linked
qwen3-max logo qwen3-max
coreutils-static-alpine Success -
qwen3-max logo qwen3-max
coreutils-static-alpine Success -
qwen3-max logo qwen3-max
coreutils-static-alpine Success -
qwen3-max logo qwen3-max
cowsay Success -
qwen3-max logo qwen3-max
cowsay Success -
qwen3-max logo qwen3-max
cowsay Failure task failed: Cowsay does not contain expected string (eyes)
qwen3-max logo qwen3-max
curl Success -
qwen3-max logo qwen3-max
curl Success -
qwen3-max logo qwen3-max
curl Success -
qwen3-max logo qwen3-max
curl-ssl Success -
qwen3-max logo qwen3-max
curl-ssl Success -
qwen3-max logo qwen3-max
curl-ssl Success -
qwen3-max logo qwen3-max
curl-ssl-arm64-static Failure task failed: curl-arm64 is not aarch64 architecture
qwen3-max logo qwen3-max
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
qwen3-max logo qwen3-max
curl-ssl-arm64-static Failure task failed: curl-arm64 is not statically linked
qwen3-max logo qwen3-max
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
qwen3-max logo qwen3-max
curl-ssl-arm64-static2 Failure task failed: curl-arm64 is not statically linked
qwen3-max logo qwen3-max
curl-ssl-arm64-static2 Failure exceeded max cost dollars (max=$10.00, current=10.32)
qwen3-max logo qwen3-max
jq Success -
qwen3-max logo qwen3-max
jq Success -
qwen3-max logo qwen3-max
jq Success -
qwen3-max logo qwen3-max
jq-static Failure task failed: jq is not statically linked
qwen3-max logo qwen3-max
jq-static Failure task failed: jq is not statically linked
qwen3-max logo qwen3-max
jq-static Failure task failed: jq is not statically linked
qwen3-max logo qwen3-max
jq-static-musl Success -
qwen3-max logo qwen3-max
jq-static-musl Success -
qwen3-max logo qwen3-max
jq-static-musl Success -
qwen3-max logo qwen3-max
jq-windows Success -
qwen3-max logo qwen3-max
jq-windows Failure task failed: jq help does not contain expected string
qwen3-max logo qwen3-max
jq-windows Failure task failed: jq help does not contain expected string
qwen3-max logo qwen3-max
jq-windows2 Failure task failed: jq.exe binary does not exist
qwen3-max logo qwen3-max
jq-windows2 Failure task failed: jq.exe binary does not exist
qwen3-max logo qwen3-max
jq-windows2 Failure task failed: jq.exe binary does not exist