Package evaluation of LoopManagers on Julia 1.13.0-DEV.860 (6ddb3d6410*) started at 2025-07-13T14:28:06.886 ################################################################################ # Set-up # Installing PkgEval dependencies (TestEnv)... Set-up completed after 7.72s ################################################################################ # Installation # Installing LoopManagers... Resolving package versions... Updating `~/.julia/environments/v1.13/Project.toml` [d33d4f76] + LoopManagers v0.2.0 Updating `~/.julia/environments/v1.13/Manifest.toml` [79e6a3ab] + Adapt v4.3.0 [4fba245c] + ArrayInterface v7.19.0 [62783981] + BitTwiddlingConvenienceFunctions v0.1.6 [2a0fbf3d] + CPUSummary v0.2.6 [fb6a15b2] + CloseOpenIntervals v0.1.13 [f70d9fcc] + CommonWorldInvalidations v1.0.0 [34da2185] + Compat v4.17.0 [adafc99b] + CpuId v0.3.1 [3e5b6fbb] + HostCPUFeatures v0.1.17 [615f187c] + IfElse v0.1.1 [10f19ff3] + LayoutPointers v0.1.17 [d33d4f76] + LoopManagers v0.2.0 [1914dd2f] + MacroTools v0.5.16 [36a4119d] + ManagedLoops v0.1.13 [d125e4d3] + ManualMemory v0.1.8 [65ce6f38] + PackageExtensionCompat v1.0.2 [f517fe37] + Polyester v0.7.18 [1d0040c9] + PolyesterWeave v0.2.2 [aea7be01] + PrecompileTools v1.3.2 [21216c6a] + Preferences v1.4.3 [ae029012] + Requires v1.3.1 [fdea26ae] + SIMD v3.7.1 [94e857df] + SIMDTypes v0.1.0 [aedffcd0] + Static v1.2.0 [0d7ed370] + StaticArrayInterface v1.8.0 [7792a7ef] + StrideArraysCore v0.5.7 [8290d209] + ThreadingUtilities v0.5.5 [56f22d72] + Artifacts v1.11.0 [2a0f44e3] + Base64 v1.11.0 [ade2ca70] + Dates v1.11.0 [ac6e5ff7] + JuliaSyntaxHighlighting v1.12.0 [8f399da3] + Libdl v1.11.0 [37e2e46d] + LinearAlgebra v1.12.0 [d6f4376e] + Markdown v1.11.0 [de0858da] + Printf v1.11.0 [9a3f8284] + Random v1.11.0 [ea8e919c] + SHA v0.7.0 [f489334b] + StyledStrings v1.11.0 [fa267f1f] + TOML v1.0.3 [cf7118a7] + UUIDs v1.11.0 [4ec0a83e] + Unicode v1.11.0 [e66e0078] + CompilerSupportLibraries_jll v1.3.0+1 [4536629a] + OpenBLAS_jll v0.3.29+0 [8e850b90] + libblastrampoline_jll v5.13.1+0 Installation completed after 3.78s ################################################################################ # Precompilation # Precompiling PkgEval dependencies... Precompiling package dependencies... Precompilation completed after 44.54s ################################################################################ # Testing # Testing LoopManagers Status `/tmp/jl_tICf7B/Project.toml` [0ca39b1e] Chairmarks v1.3.1 [63c18a36] KernelAbstractions v0.9.37 [d33d4f76] LoopManagers v0.2.0 [36a4119d] ManagedLoops v0.1.13 [d22a7203] SIMDMathFunctions v0.1.3 [811555cd] ThreadPinning v1.0.2 [b77e0a4c] InteractiveUtils v1.11.0 [8dfed614] Test v1.11.0 Status `/tmp/jl_tICf7B/Manifest.toml` [79e6a3ab] Adapt v4.3.0 [4fba245c] ArrayInterface v7.19.0 [a9b6321e] Atomix v1.1.1 [62783981] BitTwiddlingConvenienceFunctions v0.1.6 [fa961155] CEnum v0.5.0 [2a0fbf3d] CPUSummary v0.2.6 [0ca39b1e] Chairmarks v1.3.1 [fb6a15b2] CloseOpenIntervals v0.1.13 [f70d9fcc] CommonWorldInvalidations v1.0.0 [34da2185] Compat v4.17.0 [adafc99b] CpuId v0.3.1 [8bb1440f] DelimitedFiles v1.9.1 [3e5b6fbb] HostCPUFeatures v0.1.17 [0e44f5e4] Hwloc v3.3.0 [615f187c] IfElse v0.1.1 [692b3bcd] JLLWrappers v1.7.0 [63c18a36] KernelAbstractions v0.9.37 [10f19ff3] LayoutPointers v0.1.17 [d33d4f76] LoopManagers v0.2.0 [1914dd2f] MacroTools v0.5.16 [36a4119d] ManagedLoops v0.1.13 [d125e4d3] ManualMemory v0.1.8 [65ce6f38] PackageExtensionCompat v1.0.2 [f517fe37] Polyester v0.7.18 [1d0040c9] PolyesterWeave v0.2.2 [aea7be01] PrecompileTools v1.3.2 [21216c6a] Preferences v1.4.3 [ae029012] Requires v1.3.1 [fdea26ae] SIMD v3.7.1 [d22a7203] SIMDMathFunctions v0.1.3 [94e857df] SIMDTypes v0.1.0 [476501e8] SLEEFPirates v0.6.43 [91464d47] StableTasks v0.1.7 [aedffcd0] Static v1.2.0 [0d7ed370] StaticArrayInterface v1.8.0 [90137ffa] StaticArrays v1.9.13 [1e83bf80] StaticArraysCore v1.4.3 [7792a7ef] StrideArraysCore v0.5.7 [90a7ee08] SysInfo v0.3.0 [811555cd] ThreadPinning v1.0.2 [6f48bc29] ThreadPinningCore v0.4.5 [8290d209] ThreadingUtilities v0.5.5 [013be700] UnsafeAtomics v0.3.0 [3d5dd08c] VectorizationBase v0.21.71 [e33a78d0] Hwloc_jll v2.12.1+0 [56f22d72] Artifacts v1.11.0 [2a0f44e3] Base64 v1.11.0 [ade2ca70] Dates v1.11.0 [b77e0a4c] InteractiveUtils v1.11.0 [ac6e5ff7] JuliaSyntaxHighlighting v1.12.0 [8f399da3] Libdl v1.11.0 [37e2e46d] LinearAlgebra v1.12.0 [56ddb016] Logging v1.11.0 [d6f4376e] Markdown v1.11.0 [a63ad114] Mmap v1.11.0 [de0858da] Printf v1.11.0 [9a3f8284] Random v1.11.0 [ea8e919c] SHA v0.7.0 [9e88b42a] Serialization v1.11.0 [f489334b] StyledStrings v1.11.0 [fa267f1f] TOML v1.0.3 [8dfed614] Test v1.11.0 [cf7118a7] UUIDs v1.11.0 [4ec0a83e] Unicode v1.11.0 [e66e0078] CompilerSupportLibraries_jll v1.3.0+1 [4536629a] OpenBLAS_jll v0.3.29+0 [8e850b90] libblastrampoline_jll v5.13.1+0 Testing Running tests... Precompiling packages... 26998.7 ms ✓ SIMDMathFunctions 1 dependency successfully precompiled in 27 seconds. 30 already precompiled. Precompiling packages... 2491.6 ms ✓ StaticArrayInterface → StaticArrayInterfaceStaticArraysExt 1 dependency successfully precompiled in 4 seconds. 20 already precompiled. Precompiling packages... 969.4 ms ✓ ManagedLoops → ManagedLoopsAdaptExt 1 dependency successfully precompiled in 1 seconds. 6 already precompiled. Precompiling packages... 3229.9 ms ✓ ThreadPinningCore 11310.4 ms ✓ SysInfo Info Given ThreadPinning was explicitly requested, output will be shown live  ┌ Warning: Error return code │ ret = -22 └ @ ThreadPinningCore.Internals ~/.julia/packages/ThreadPinningCore/fdkhT/src/internals.jl:173 15642.0 ms ✓ ThreadPinning 3 dependencies successfully precompiled in 30 seconds. 14 already precompiled. 1 dependency had output during precompilation: ┌ ThreadPinning │ [Output was shown above] └ Hostname: LoopManagers-primary-UI9FneRl CPU(s): 2 x AMD EPYC 7502 32-Core Processor CPU target: znver2 Cores: 64 (128 CPU-threads due to 2-way SMT) NUMA domains: 2 (32 cores each) Julia threads: 1 CPU socket 1 0,64, 1,65, 2,66, 3,67, 4,68, 5,69, 6,70, 7,71, 8,72, 9,73, 10,74, 11,75, 12,76, 13,77, 14,78, 15,79, 16,80, 17,81, 18,82, 19,83, 20,84, 21,85, 22,86, 23,87, 24,88, 25,89, 26,90, 27,91, 28,92, 29,93, 30,94, 31,95 CPU socket 2 32,96, 33,97, 34,98, 35,99, 36,100, 37,101, 38,102, 39,103, 40,104, 41,105, 42,106, 43,107, 44,108, 45,109, 46,110, 47,111, 48,112, 49,113, 50,114, 51,115, 52,116, 53,117, 54,118, 55,119, 56,120, 57,121, 58,122, 59,123, 60,124, 61,125, 62,126, 63,127 # = Julia thread, # = Julia thread on HT, # = >1 Julia thread (Mapping: 1 => 0,) Julia Version 1.13.0-DEV.860 Commit 6ddb3d6410* (2025-07-12 21:53 UTC) Platform Info: OS: Linux (x86_64-linux-gnu) CPU: 128 × AMD EPYC 7502 32-Core Processor WORD_SIZE: 64 LLVM: libLLVM-20.1.2 (ORCJIT, znver2) GC: Built with stock GC Threads: 1 default, 0 interactive, 1 GC (on 1 virtual cores) Environment: JULIA_CPU_THREADS = 1 JULIA_NUM_PRECOMPILE_TASKS = 1 JULIA_PKG_PRECOMPILE_AUTO = 0 JULIA_PKGEVAL = true JULIA_DEPOT_PATH = /home/pkgeval/.julia:/usr/local/share/julia: JULIA_NUM_THREADS = 1 JULIA_LOAD_PATH = @:/tmp/jl_tICf7B WARNING: llvmcall with integer pointers is deprecated. Use actual pointers instead, replacing i32 or i64 with i8* or ptr in myfun(Any) at /home/pkgeval/.julia/packages/LoopManagers/RRavr/test/runtests.jl [ Info: ====== Multi-thread scaling: compute-bound loop ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.020090077 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_1 vlen=4 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.00059482275 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_2 vlen=4 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.0009050209 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_3 vlen=4 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.0010950123 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_1 vlen=16 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.00029862835 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_2 vlen=16 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.00029698445 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_3 vlen=16 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.0003713707 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_1 vlen=64 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.00020733764 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_2 vlen=64 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.00020820292 100.0% 100.0% [ Info: ====== Multi-thread scaling: reverse cumsum_3 vlen=64 ====== [ Info: Threads elapsed speedup efficiency [ Info: 1 0.00019143173 100.0% 100.0% [ Info: Testing MainThread with 10 threads. [ Info: Worker 1 [ Info: Worker 2 [ Info: Worker 3 [ Info: Worker 4 [ Info: Worker 5 [ Info: Worker 6 [ Info: Worker 7 [ Info: Worker 8 [ Info: Worker 9 [ Info: Worker 10 Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Thread 1 has drawn -0.22327894001693138. Test Summary: | Pass Total Time OpenMP-like manager | 1 1 1.8s [ Info: PlainCPU() Benchmark: 23 samples with 1 evaluation min 44.140 ms median 44.575 ms mean 44.981 ms max 47.920 ms [ Info: VectorizedCPU(8) Benchmark: 81 samples with 1 evaluation min 11.866 ms median 12.322 ms mean 12.341 ms max 14.222 ms [ Info: MultiThread(PlainCPU(), 1) Benchmark: 23 samples with 1 evaluation min 43.528 ms median 44.056 ms mean 44.445 ms max 48.749 ms [ Info: MultiThread(VectorizedCPU(8), 1) Benchmark: 82 samples with 1 evaluation min 11.858 ms median 12.156 ms mean 12.205 ms max 14.877 ms [ Info: MultiThread(VectorizedCPU(16), 1) Benchmark: 99 samples with 1 evaluation min 9.987 ms median 10.153 ms mean 10.182 ms max 10.676 ms [ Info: MultiThread(VectorizedCPU(32), 1) Benchmark: 87 samples with 1 evaluation min 11.116 ms median 11.496 ms mean 11.488 ms max 11.893 ms [ Info: MainThread(VectorizedCPU(8)) Benchmark: 84 samples with 1 evaluation min 11.644 ms median 11.956 ms mean 11.946 ms max 12.248 ms [ Info: tune([PlainCPU(),VectorizedCPU(8),MultiThread(PlainCPU(), 1),MultiThread(VectorizedCPU(8), 1),MultiThread(VectorizedCPU(16), 1),MultiThread(VectorizedCPU(32), 1)]) Benchmark: 7 samples with 1 evaluation min 12.161 ms (4 allocs: 128 bytes) median 45.456 ms (8 allocs: 256 bytes) mean 149.146 ms (38957.29 allocs: 2.146 MiB, 38.91% compile time) max 392.929 ms (165699 allocs: 9.177 MiB, 96.93% compile time) Test Summary: | Pass Total Time SIMD, multithread and auto-tuned managers | 8 8 34.8s Test Summary: | Pass Total Time Managed broadcasting | 16 16 21.0s Testing LoopManagers tests passed Testing completed after 232.43s PkgEval succeeded after 307.92s