Package evaluation to test SIMD on Julia 1.14.0-DEV.1601 (79ea5eb99c*) started at 2026-01-25T12:50:47.113 ################################################################################ # Set-up # Installing PkgEval dependencies (TestEnv)... Activating project at `~/.julia/environments/v1.14` Set-up completed after 7.45s ################################################################################ # Installation # Installing SIMD... Resolving package versions... Updating `~/.julia/environments/v1.14/Project.toml` [fdea26ae] + SIMD v3.7.2 Updating `~/.julia/environments/v1.14/Manifest.toml` [aea7be01] + PrecompileTools v1.3.3 [21216c6a] + Preferences v1.5.1 [fdea26ae] + SIMD v3.7.2 [ade2ca70] + Dates v1.11.0 [de0858da] + Printf v1.11.0 [fa267f1f] + TOML v1.0.3 [4ec0a83e] + Unicode v1.11.0 Installation completed after 0.67s ################################################################################ # Precompilation # Precompiling PkgEval dependencies... Precompiling package dependencies... Precompiling packages... 1162.4 ms ✓ OpenCL_Headers_jll 1158.4 ms ✓ SPIRV_LLVM_Translator_jll 1239.0 ms ✓ SPIRV_Tools_jll 1601.0 ms ✓ LLD_unified_jll 1141.0 ms ✓ SPIRV_LLVM_Backend_jll 1259.6 ms ✓ OpenCL_jll 1227.6 ms ✓ Clang_unified_jll 8774.7 ms ✓ SPIRVIntrinsics 13240.6 ms ✓ GPUArrays 1283.7 ms ✓ pocl_jll 8569.3 ms ✓ SPIRVIntrinsics → SPIRVIntrinsicsSIMDExt 11035.1 ms ✓ OpenCL 6741.7 ms ✓ Atomix → AtomixOpenCLExt 13 dependencies successfully precompiled in 60 seconds. 79 already precompiled. Precompilation completed after 74.78s ################################################################################ # Testing # Testing SIMD Status `/tmp/jl_TZPl4i/Project.toml` [63c18a36] KernelAbstractions v0.9.39 [08131aa3] OpenCL v0.10.8 [fdea26ae] SIMD v3.7.2 [627d6b7a] pocl_jll v7.1.0+1 [b77e0a4c] InteractiveUtils v1.11.0 [8dfed614] Test v1.11.0 Status `/tmp/jl_TZPl4i/Manifest.toml` [79e6a3ab] Adapt v4.4.0 [a9b6321e] Atomix v1.1.2 [fa961155] CEnum v0.5.0 [ffbed154] DocStringExtensions v0.9.5 [e2ba6199] ExprTools v0.1.10 [0c68f7d7] GPUArrays v11.3.4 [46192b85] GPUArraysCore v0.2.0 [61eb1bfa] GPUCompiler v1.8.1 [096a3bc2] GPUToolbox v1.0.0 [076d061b] HashArrayMappedTries v0.2.0 [92d709cd] IrrationalConstants v0.2.6 [692b3bcd] JLLWrappers v1.7.1 [63c18a36] KernelAbstractions v0.9.39 [929cbde3] LLVM v9.4.4 [2ab3a3ac] LogExpFunctions v0.3.29 [1914dd2f] MacroTools v0.5.16 [08131aa3] OpenCL v0.10.8 [aea7be01] PrecompileTools v1.3.3 [21216c6a] Preferences v1.5.1 [74087812] Random123 v1.7.1 [e6cf234a] RandomNumbers v1.6.0 [189a3867] Reexport v1.2.2 [ae029012] Requires v1.3.1 [fdea26ae] SIMD v3.7.2 [71d1d633] SPIRVIntrinsics v0.5.6 [7e506255] ScopedValues v1.5.0 [6c6a2e73] Scratch v1.3.0 [276daf66] SpecialFunctions v2.6.1 [90137ffa] StaticArrays v1.9.16 [1e83bf80] StaticArraysCore v1.4.4 [10745b16] Statistics v1.11.1 [e689c965] Tracy v0.1.6 [013be700] UnsafeAtomics v0.3.0 [ffc816e1] Clang_unified_jll v0.1.1+0 [e33a78d0] Hwloc_jll v2.12.2+0 [fbc507ec] LLD_unified_jll v0.1.0+4 [dad2f222] LLVMExtra_jll v0.0.38+0 [ad6e5548] LibTracyClient_jll v0.13.1+0 [94ce4f54] Libiconv_jll v1.18.0+0 [a7aa756b] OpenCL_Headers_jll v2025.6.13+0 [6cb37087] OpenCL_jll v2024.10.24+1 [efe28fd5] OpenSpecFun_jll v0.5.6+0 ⌅ [4376b9bf] SPIRV_LLVM_Backend_jll v20.1.5+3 ⌅ [4a5d46fc] SPIRV_LLVM_Translator_jll v20.1.0+6 [6ac6d60f] SPIRV_Tools_jll v2025.4.0+0 ⌅ [02c8fc9c] XML2_jll v2.13.9+0 [a65dc6b1] Xorg_libpciaccess_jll v0.18.1+0 [627d6b7a] pocl_jll v7.1.0+1 [0dad84c5] ArgTools v1.1.2 [56f22d72] Artifacts v1.11.0 [2a0f44e3] Base64 v1.11.0 [ade2ca70] Dates v1.11.0 [f43a241f] Downloads v1.7.0 [7b1f6079] FileWatching v1.11.0 [b77e0a4c] InteractiveUtils v1.11.0 [ac6e5ff7] JuliaSyntaxHighlighting v1.13.0 [4af54fe1] LazyArtifacts v1.11.0 [b27032c2] LibCURL v1.0.0 [76f85450] LibGit2 v1.11.0 [8f399da3] Libdl v1.11.0 [37e2e46d] LinearAlgebra v1.13.0 [56ddb016] Logging v1.11.0 [d6f4376e] Markdown v1.11.0 [ca575930] NetworkOptions v1.3.0 [44cfe95a] Pkg v1.14.0 [de0858da] Printf v1.11.0 [9a3f8284] Random v1.11.0 [ea8e919c] SHA v1.0.0 [9e88b42a] Serialization v1.11.0 [2f01184e] SparseArrays v1.13.0 [f489334b] StyledStrings v1.13.0 [fa267f1f] TOML v1.0.3 [a4e569a6] Tar v1.10.0 [8dfed614] Test v1.11.0 [cf7118a7] UUIDs v1.11.0 [4ec0a83e] Unicode v1.11.0 [e66e0078] CompilerSupportLibraries_jll v1.3.0+1 [deac9b47] LibCURL_jll v8.18.0+0 [e37daf67] LibGit2_jll v1.9.2+0 [29816b5a] LibSSH2_jll v1.11.3+1 [14a3606d] MozillaCACerts_jll v2025.12.2 [4536629a] OpenBLAS_jll v0.3.29+0 [05823500] OpenLibm_jll v0.8.7+0 [458c3c95] OpenSSL_jll v3.5.4+0 [efcefdf7] PCRE2_jll v10.47.0+0 [bea87d4a] SuiteSparse_jll v7.10.1+0 [83775a58] Zlib_jll v1.3.1+2 [3161d3a3] Zstd_jll v1.5.7+1 [8f36deef] libLLVM_jll v20.1.8+0 [8e850b90] libblastrampoline_jll v5.15.0+0 [8e850ede] nghttp2_jll v1.68.0+1 [3f19e933] p7zip_jll v17.7.0+0 Info Packages marked with ⌅ have new versions available but compatibility constraints restrict them from upgrading. Testing Running tests... Test Summary: | Pass Total Time Type properties | 10 10 0.5s Test Summary: | Pass Total Time Errors | 2 2 0.8s Test Summary: | Pass Total Time Type conversion | 8 8 0.5s Test Summary: | Pass Total Time Conversion and reinterpretation | 4 4 0.6s Test Summary: | Pass Total Time convert(target_type, source) | 144 144 8.6s Test Summary: | Pass Total Time Element-wise access | 22 22 0.3s Test Summary: | Pass Total Time Integer arithmetic functions | 116 116 3.0s Test Summary: | Pass Total Time bswap | 6 6 0.3s Test Summary: | Pass Total Time saturation | 4 4 0.2s Test Summary: | Pass Total Time overflow arithmetic | 24 24 3.0s Test Summary: | Pass Total Time Floating point arithmetic functions | 106 106 23.3s Test Summary: | Pass Total Time Type promotion | 88 88 1.1s Test Summary: | Pass Total Time Reduction operations | 21 21 1.1s Test Summary: | Pass Total Time Load and store functions | 66 66 1.1s Test Summary: | Pass Total Time Load and store with pointers | 168 168 2.7s Test Summary: | Pass Total Time fastmath | 6 6 0.2s Test Summary: | Pass Total Time Gather and scatter function | 174 174 5.3s Test Summary: | Pass Total Time expandload | 10 10 0.1s Test Summary: | Pass Total Time compressstore | 10 10 0.4s Test Summary: | Pass Total Time Index-based load/store | 112 112 4.0s ; Function Signature: vsum(Base.SubArray{Float64, 1, Array{Float64, 1}, Tuple{Base.UnitRange{Int64}}, true}, Type{SIMD.Vec{4, Float64}}) ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:866 within `vsum` define double @julia_vsum_18895(ptr nocapture noundef nonnull readonly align 8 dereferenceable(40) %"xs::SubArray", ptr nocapture noundef nonnull readonly align 8 dereferenceable(8) %.roots.xs) local_unnamed_addr #0 { top: %gcframe1 = alloca [3 x ptr], align 16 call void @llvm.memset.p0.i64(ptr align 16 %gcframe1, i8 0, i64 24, i1 true) %thread_ptr = call ptr asm "movq %fs:0, $0", "=r"() #12 %tls_ppgcstack = getelementptr inbounds i8, ptr %thread_ptr, i64 -8 %tls_pgcstack = load ptr, ptr %tls_ppgcstack, align 8 store i64 4, ptr %gcframe1, align 8 %frame.prev = getelementptr inbounds nuw i8, ptr %gcframe1, i64 8 %task.gcstack = load ptr, ptr %tls_pgcstack, align 8 store ptr %task.gcstack, ptr %frame.prev, align 8 store ptr %gcframe1, ptr %tls_pgcstack, align 8 ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:867 within `vsum` ; ┌ @ essentials.jl:11 within `length` ; │┌ @ subarray.jl:65 within `size` ; ││┌ @ subarray.jl:512 within `axes` ; │││┌ @ Base_compiler.jl:57 within `getproperty` %0 = getelementptr inbounds nuw i8, ptr %"xs::SubArray", i64 8 ; │││└ ; │││┌ @ subarray.jl:517 within `_indices_sub` ; ││││┌ @ abstractarray.jl:102 within `axes` ; │││││┌ @ range.jl:692 within `size` ; ││││││┌ @ range.jl:782 within `length` ; │││││││┌ @ range.jl:863 within `last` ; ││││││││┌ @ Base_compiler.jl:57 within `getproperty` %.stop_ptr = getelementptr inbounds nuw i8, ptr %"xs::SubArray", i64 16 ; │││││││└└ ; │││││││ @ range.jl:785 within `length` ; │││││││┌ @ int.jl:86 within `-` %.stop_ptr.unbox = load i64, ptr %.stop_ptr, align 8 %.unbox = load i64, ptr %0, align 8 %1 = sub i64 %.stop_ptr.unbox, %.unbox ; │││││││└ ; │││││││┌ @ int.jl:87 within `+` %2 = add i64 %1, 1 ; └└└└└└└└ ; ┌ @ promotion.jl:637 within `==` %3 = and i64 %2, 3 %.not = icmp eq i64 %3, 0 ; └ br i1 %.not, label %L10, label %L125 L10: ; preds = %top ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:870 within `vsum` ; ┌ @ range.jl:22 within `Colon` ; │┌ @ range.jl:24 within `_colon` ; ││┌ @ range.jl:393 within `StepRange` @ range.jl:338 ; │││┌ @ range.jl:353 within `steprange_last` ; ││││┌ @ promotion.jl:637 within `==` %.not30 = icmp eq i64 %.stop_ptr.unbox, %.unbox ; ││││└ br i1 %.not30, label %L69.preheader, label %L22 L22: ; preds = %L10 ; ││││ @ range.jl:356 within `steprange_last` ; ││││┌ @ operators.jl:424 within `>` ; │││││┌ @ int.jl:83 within `<` %4 = icmp sgt i64 %2, 1 ; ││││└└ br i1 %4, label %L55, label %L122 L55: ; preds = %L22 ; ││││ @ range.jl:369 within `steprange_last` ; ││││┌ @ int.jl:342 within `rem` %5 = and i64 %1, 3 ; ││││└ ; ││││ @ range.jl:372 within `steprange_last` %6 = sub nsw i64 %2, %5 ; └└└└ ; ┌ @ range.jl:921 within `iterate` ; │┌ @ range.jl:694 within `isempty` ; ││┌ @ bool.jl:40 within `&` %7 = icmp slt i64 %6, 1 ; └└└ br i1 %7, label %L122, label %L69.preheader L69.preheader: ; preds = %L55, %L10 %value_phi34 = phi i64 [ %6, %L55 ], [ 1, %L10 ] %8 = load ptr, ptr %.roots.xs, align 8 %9 = getelementptr inbounds nuw i8, ptr %"xs::SubArray", i64 24 %.unbox12 = load i64, ptr %9, align 8 %10 = load ptr, ptr %8, align 8 ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:872 within `vsum` %invariant.gep = getelementptr i8, ptr %10, i64 -8 br label %L69 L69: ; preds = %L69, %L69.preheader %value_phi6 = phi i64 [ %14, %L69 ], [ 1, %L69.preheader ] %value_phi8 = phi <4 x double> [ %13, %L69 ], [ zeroinitializer, %L69.preheader ] ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:871 within `vsum` ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:307 within `getindex` ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:295 within `_pointer` ; ││┌ @ subarray.jl:498 within `pointer` ; │││┌ @ int.jl:87 within `+` %11 = add i64 %.unbox12, %value_phi6 ; │││└ ; │││ @ subarray.jl:498 within `pointer` @ abstractarray.jl:1276 ; │││┌ @ abstractarray.jl:1280 within `_memory_offset` ; ││││┌ @ int.jl:88 within `*` %12 = shl i64 %11, 3 ; │││└└ ; │││┌ @ pointer.jl:314 within `+` %gep = getelementptr i8, ptr %invariant.gep, i64 %12 ; │└└└ ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 within `vload` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:51 ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:477 within `load` ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:486 within `macro expansion` %res.i = load <4 x double>, ptr %gep, align 8 ; └└└└ ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/simdvec.jl:264 within `+` ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:227 within `fadd` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:227 ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:236 within `macro expansion` %13 = fadd <4 x double> %value_phi8, %res.i ; └└└ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:872 within `vsum` ; ┌ @ range.jl:925 within `iterate` ; │┌ @ promotion.jl:637 within `==` %.not31 = icmp eq i64 %value_phi6, %value_phi34 ; │└ %14 = add i64 %value_phi6, 4 ; └ br i1 %.not31, label %L122, label %L69 L122: ; preds = %L69, %L55, %L22 %value_phi16 = phi <4 x double> [ zeroinitializer, %L55 ], [ zeroinitializer, %L22 ], [ %13, %L69 ] ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:873 within `vsum` ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/simdvec.jl:498 within `sum` ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:1077 within `reduce_fadd` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:1093 within `macro expansion` %res.i29 = call reassoc double @llvm.vector.reduce.fadd.v4f64(double 0.000000e+00, <4 x double> %value_phi16) %frame.prev41 = load ptr, ptr %frame.prev, align 8 store ptr %frame.prev41, ptr %tls_pgcstack, align 8 ret double %res.i29 L125: ; preds = %top ; └└└ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:867 within `vsum` %15 = call [1 x ptr] @j_AssertionError_18904(ptr nonnull @"jl_global#18905.jit") %gc_slot_addr_0 = getelementptr inbounds nuw i8, ptr %gcframe1, i64 16 %16 = extractvalue [1 x ptr] %15, 0 store ptr %16, ptr %gc_slot_addr_0, align 8 %ptls_field = getelementptr inbounds nuw i8, ptr %tls_pgcstack, i64 16 %ptls_load = load ptr, ptr %ptls_field, align 8 %"box::AssertionError" = call noalias nonnull align 8 dereferenceable(16) ptr @ijl_gc_small_alloc(ptr %ptls_load, i32 360, i32 16, i64 140585071995952) #8 %"box::AssertionError.tag_addr" = getelementptr inbounds i8, ptr %"box::AssertionError", i64 -8 store atomic i64 140585071995952, ptr %"box::AssertionError.tag_addr" unordered, align 8 store ptr %16, ptr %"box::AssertionError", align 8 store ptr null, ptr %gc_slot_addr_0, align 8 call void @ijl_throw(ptr nonnull %"box::AssertionError") unreachable } .file "vsum" .section .ltext,"axl",@progbits .globl julia_vsum_18948 # -- Begin function julia_vsum_18948 .p2align 4 .type julia_vsum_18948,@function julia_vsum_18948: # @julia_vsum_18948 ; Function Signature: vsum(Base.SubArray{Float64, 1, Array{Float64, 1}, Tuple{Base.UnitRange{Int64}}, true}, Type{SIMD.Vec{4, Float64}}) ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:866 within `vsum` # %bb.0: # %top #DEBUG_VALUE: vsum:xs <- [$rdi+0] push rbp mov rbp, rsp push r14 push rbx sub rsp, 32 vxorpd xmm0, xmm0, xmm0 #APP mov rax, qword ptr fs:[0] #NO_APP lea rcx, [rbp - 48] vmovapd xmmword ptr [rbp - 48], xmm0 mov qword ptr [rbp - 32], 0 mov r14, qword ptr [rax - 8] mov qword ptr [rbp - 48], 4 mov rax, qword ptr [r14] mov qword ptr [rbp - 40], rax mov qword ptr [r14], rcx ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:867 within `vsum` ; │┌ @ essentials.jl:11 within `length` ; ││┌ @ subarray.jl:65 within `size` ; │││┌ @ subarray.jl:512 within `axes` ; ││││┌ @ subarray.jl:517 within `_indices_sub` ; │││││┌ @ abstractarray.jl:102 within `axes` ; ││││││┌ @ range.jl:692 within `size` ; │││││││┌ @ range.jl:785 within `length` ; ││││││││┌ @ int.jl:86 within `-` mov r9, qword ptr [rdi + 16] mov r8, qword ptr [rdi + 8] mov rcx, r9 sub rcx, r8 ; ││││││││└ ; ││││││││┌ @ int.jl:87 within `+` lea rdx, [rcx + 1] ; │└└└└└└└└ ; │┌ @ promotion.jl:637 within `==` test dl, 3 ; │└ jne .LBB0_7 # %bb.1: # %L10 mov eax, 1 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:870 within `vsum` ; │┌ @ range.jl:22 within `Colon` ; ││┌ @ range.jl:24 within `_colon` ; │││┌ @ range.jl:393 within `StepRange` @ range.jl:338 ; ││││┌ @ range.jl:353 within `steprange_last` ; │││││┌ @ promotion.jl:637 within `==` cmp r9, r8 ; │││││└ je .LBB0_4 # %bb.2: # %L22 vxorpd xmm0, xmm0, xmm0 ; │││││ @ range.jl:356 within `steprange_last` ; │││││┌ @ operators.jl:424 within `>` ; ││││││┌ @ int.jl:83 within `<` cmp rdx, 2 ; │││││└└ jl .LBB0_6 # %bb.3: # %L55 ; │││││ @ range.jl:369 within `steprange_last` ; │││││┌ @ int.jl:342 within `rem` and ecx, 3 ; │└└└└└ ; │┌ @ range.jl:921 within `iterate` ; ││┌ @ range.jl:694 within `isempty` ; │││┌ @ bool.jl:40 within `&` sub rdx, rcx mov rax, rdx ; │└└└ jle .LBB0_6 .LBB0_4: # %L69.preheader mov rcx, qword ptr [rdi + 24] mov rdx, qword ptr [rsi] ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:872 within `vsum` add rax, 3 vxorpd xmm0, xmm0, xmm0 shl rcx, 3 add rcx, qword ptr [rdx] xor edx, edx .p2align 4 .LBB0_5: # %L69 # =>This Inner Loop Header: Depth=1 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:871 within `vsum` ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/simdvec.jl:264 within `+` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:227 within `fadd` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:227 ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:236 within `macro expansion` vaddpd ymm0, ymm0, ymmword ptr [rcx + 8*rdx] ; │└└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:872 within `vsum` ; │┌ @ range.jl:925 within `iterate` ; ││┌ @ promotion.jl:637 within `==` add rdx, 4 cmp rax, rdx ; │└└ jne .LBB0_5 .LBB0_6: # %L122 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:873 within `vsum` ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/simdvec.jl:498 within `sum` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:1077 within `reduce_fadd` ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:1093 within `macro expansion` vextractf128 xmm1, ymm0, 1 mov rax, qword ptr [rbp - 40] vaddpd xmm0, xmm0, xmm1 vshufpd xmm1, xmm0, xmm0, 1 # xmm1 = xmm0[1,0] mov qword ptr [r14], rax vaddsd xmm0, xmm0, xmm1 vxorpd xmm1, xmm1, xmm1 vaddsd xmm0, xmm0, xmm1 add rsp, 32 pop rbx pop r14 pop rbp vzeroupper ret .LBB0_7: # %L125 ; │└└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:867 within `vsum` movabs rdi, offset ".Ljl_global#18958.jit" movabs rax, offset j_AssertionError_18957 call rax mov qword ptr [rbp - 32], rax mov rbx, rax movabs rax, offset ijl_gc_small_alloc mov esi, 360 mov edx, 16 mov rdi, qword ptr [r14 + 16] movabs r14, 140585071995952 mov rcx, r14 call rax movabs rcx, offset ijl_throw mov rdi, rax mov qword ptr [rax - 8], r14 mov qword ptr [rax], rbx mov qword ptr [rbp - 32], 0 call rcx .Lfunc_end0: .size julia_vsum_18948, .Lfunc_end0-julia_vsum_18948 ; └ # -- End function .type ".L+Core.Float64#18950",@object # @"+Core.Float64#18950" .section .lrodata,"al",@progbits .p2align 3, 0x0 ".L+Core.Float64#18950": .quad ".L+Core.Float64#18950.jit" .size ".L+Core.Float64#18950", 8 .set ".Ljl_global#18958.jit", 140584852177424 .size ".Ljl_global#18958.jit", 8 .set ".L+Core.AssertionError#18959.jit", 140585071995952 .size ".L+Core.AssertionError#18959.jit", 8 .set ".L+Core.Float64#18950.jit", 140585143868880 .size ".L+Core.Float64#18950.jit", 8 .section ".note.GNU-stack","",@progbits ; Function Signature: vcompress!(Array{Float64, 1}, Array{Bool, 1}, Array{Float64, 1}) ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:939 within `vcompress!` define nonnull ptr @"japi1_vcompress!_19079"(ptr %"function::Core.Function", ptr noalias nocapture noundef readonly %"args::Any[]", i32 %"nargs::UInt32") local_unnamed_addr #0 { top: %gcframe2 = alloca [3 x ptr], align 16 call void @llvm.memset.p0.i64(ptr align 16 %gcframe2, i8 0, i64 24, i1 true) %stackargs = alloca ptr, align 8 store volatile ptr %"args::Any[]", ptr %stackargs, align 8 %thread_ptr = call ptr asm "movq %fs:0, $0", "=r"() #12 %tls_ppgcstack = getelementptr inbounds i8, ptr %thread_ptr, i64 -8 %tls_pgcstack = load ptr, ptr %tls_ppgcstack, align 8 store i64 4, ptr %gcframe2, align 8 %frame.prev = getelementptr inbounds nuw i8, ptr %gcframe2, i64 8 %task.gcstack = load ptr, ptr %tls_pgcstack, align 8 store ptr %task.gcstack, ptr %frame.prev, align 8 store ptr %gcframe2, ptr %tls_pgcstack, align 8 %0 = load ptr, ptr %"args::Any[]", align 8 %1 = getelementptr inbounds nuw i8, ptr %"args::Any[]", i64 16 %2 = load ptr, ptr %1, align 8 ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 ; ┌ @ abstractarray.jl:102 within `axes` ; │┌ @ essentials.jl:10 within `size` %.size_ptr = getelementptr inbounds nuw i8, ptr %2, i64 16 %.size.0.copyload = load i64, ptr %.size_ptr, align 8 %.size_ptr1 = getelementptr inbounds nuw i8, ptr %0, i64 16 %.size2.0.copyload = load i64, ptr %.size_ptr1, align 8 ; └└ ; ┌ @ tuple.jl:540 within `==` ; │┌ @ tuple.jl:544 within `_eq` ; ││┌ @ range.jl:1142 within `==` @ promotion.jl:637 %.not.not = icmp eq i64 %.size2.0.copyload, %.size.0.copyload ; └└└ br i1 %.not.not, label %L27, label %L255 L27: ; preds = %top %3 = getelementptr inbounds nuw i8, ptr %"args::Any[]", i64 8 %4 = load ptr, ptr %3, align 8 ; ┌ @ abstractarray.jl:102 within `axes` ; │┌ @ essentials.jl:10 within `size` %.size_ptr3 = getelementptr inbounds nuw i8, ptr %4, i64 16 %.size4.0.copyload = load i64, ptr %.size_ptr3, align 8 ; └└ ; ┌ @ tuple.jl:540 within `==` ; │┌ @ tuple.jl:544 within `_eq` ; ││┌ @ range.jl:1142 within `==` @ promotion.jl:637 %.not.not109 = icmp eq i64 %.size.0.copyload, %.size4.0.copyload ; └└└ br i1 %.not.not109, label %L29, label %L255 L29: ; preds = %L27 ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:941 ; ┌ @ int.jl:87 within `+` %5 = add i64 %.size.0.copyload, -3 ; └ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:945 ; ┌ @ int.jl:560 within `<=` %.not99102 = icmp slt i64 %5, 1 ; └ br i1 %.not99102, label %L189.preheader, label %L37 L189.preheader.loopexit: ; preds = %L37 %.size50.0.copyload.pre = load i64, ptr %.size_ptr, align 8 ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:947 ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:113 within `vstorec` br label %L189.preheader L189.preheader: ; preds = %L189.preheader.loopexit, %L29 %.size50.0.copyload = phi i64 [ %.size.0.copyload, %L29 ], [ %.size50.0.copyload.pre, %L189.preheader.loopexit ] %value_phi10.lcssa = phi i64 [ 1, %L29 ], [ %16, %L189.preheader.loopexit ] %value_phi11.lcssa = phi i64 [ 1, %L29 ], [ %17, %L189.preheader.loopexit ] ; └ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:951 ; ┌ @ int.jl:560 within `<=` %.not100106 = icmp sgt i64 %value_phi11.lcssa, %.size50.0.copyload ; └ br i1 %.not100106, label %L258, label %L209.lr.ph L209.lr.ph: ; preds = %L189.preheader %memoryref_data = load ptr, ptr %4, align 8 %memoryref_data60 = load ptr, ptr %2, align 8 %memoryref_data73 = load ptr, ptr %0, align 8 %invariant.gep = getelementptr i8, ptr %memoryref_data73, i64 -8 br label %L209 L37: ; preds = %L37, %L29 %value_phi11104 = phi i64 [ %17, %L37 ], [ 1, %L29 ] %value_phi10103 = phi i64 [ %16, %L37 ], [ 1, %L29 ] ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:946 ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:307 within `getindex` ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:291 within `_pointer` ; ││┌ @ abstractarray.jl:1276 within `pointer` @ abstractarray.jl:1273 ; │││┌ @ pointer.jl:63 within `cconvert` %6 = load ptr, ptr %4, align 8 ; │││└ ; │││ @ abstractarray.jl:1276 within `pointer` ; │││┌ @ abstractarray.jl:1280 within `_memory_offset` ; ││││┌ @ int.jl:86 within `-` %7 = add nsw i64 %value_phi11104, -1 ; │││└└ ; │││┌ @ pointer.jl:314 within `+` %8 = getelementptr i8, ptr %6, i64 %7 ; │└└└ ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 within `vload` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:51 ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:477 within `load` ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:486 within `macro expansion` %res.i = load <4 x i8>, ptr %8, align 1 ; └└└└ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:947 ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:307 within `getindex` ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:291 within `_pointer` ; ││┌ @ abstractarray.jl:1276 within `pointer` @ abstractarray.jl:1273 ; │││┌ @ pointer.jl:63 within `cconvert` %9 = load ptr, ptr %2, align 8 ; │││└ ; │││ @ abstractarray.jl:1276 within `pointer` ; │││┌ @ abstractarray.jl:1280 within `_memory_offset` ; ││││┌ @ int.jl:88 within `*` %10 = shl i64 %7, 3 ; │││└└ ; │││┌ @ pointer.jl:314 within `+` %11 = getelementptr i8, ptr %9, i64 %10 ; │└└└ ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 within `vload` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:51 ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:477 within `load` ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:486 within `macro expansion` %res.i96 = load <4 x double>, ptr %11, align 8 ; └└└└ ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:112 within `vstorec` ; │┌ @ abstractarray.jl:1276 within `pointer` @ abstractarray.jl:1273 ; ││┌ @ pointer.jl:63 within `cconvert` %12 = load ptr, ptr %0, align 8 ; ││└ ; ││ @ abstractarray.jl:1276 within `pointer` ; ││┌ @ abstractarray.jl:1280 within `_memory_offset` ; │││┌ @ int.jl:88 within `*` %13 = shl i64 %value_phi10103, 3 ; ││└└ ; ││┌ @ pointer.jl:314 within `+` %14 = getelementptr i8, ptr %12, i64 %13 %15 = getelementptr i8, ptr %14, i64 -8 ; │└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:113 within `vstorec` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:105 ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:663 within `maskedcompressstore` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:680 within `macro expansion` %mask.i = trunc <4 x i8> %res.i to <4 x i1> call void @llvm.masked.compressstore.v4f64(<4 x double> %res.i96, ptr %15, <4 x i1> %mask.i) ; └└└ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:948 ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/simdvec.jl:496 within `sum` ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/simdvec.jl:486 within `reduce` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:1099 within `reduce_add` ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:1129 within `macro expansion` %maski.i = bitcast <4 x i1> %mask.i to i4 %maskipopcnt.i = call range(i4 0, 5) i4 @llvm.ctpop.i4(i4 %maski.i) %res.i98 = zext nneg i4 %maskipopcnt.i to i64 ; └└└└ ; ┌ @ int.jl:87 within `+` %16 = add i64 %value_phi10103, %res.i98 ; └ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:949 ; ┌ @ int.jl:87 within `+` %17 = add i64 %value_phi11104, 4 ; └ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:945 ; ┌ @ int.jl:560 within `<=` %.not99 = icmp sgt i64 %17, %5 ; └ br i1 %.not99, label %L189.preheader.loopexit, label %L37 L209: ; preds = %L251, %L209.lr.ph %value_phi48108 = phi i64 [ %value_phi11.lcssa, %L209.lr.ph ], [ %22, %L251 ] %value_phi47107 = phi i64 [ %value_phi10.lcssa, %L209.lr.ph ], [ %value_phi82, %L251 ] ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:952 ; ┌ @ essentials.jl:965 within `getindex` %memoryref_offset = add i64 %value_phi48108, -1 %memoryref_data55 = getelementptr inbounds i8, ptr %memoryref_data, i64 %memoryref_offset %18 = load i8, ptr %memoryref_data55, align 1 %19 = trunc i8 %18 to i1 ; └ br i1 %19, label %L228, label %L251 L228: ; preds = %L209 ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:953 ; ┌ @ essentials.jl:965 within `getindex` %memoryref_byteoffset63 = shl i64 %memoryref_offset, 3 %memoryref_data68 = getelementptr inbounds i8, ptr %memoryref_data60, i64 %memoryref_byteoffset63 %20 = load double, ptr %memoryref_data68, align 8 ; └ ; ┌ @ array.jl:1020 within `setindex!` ; │┌ @ array.jl:1025 within `_setindex!` %memoryref_offset75 = shl i64 %value_phi47107, 3 %gep = getelementptr i8, ptr %invariant.gep, i64 %memoryref_offset75 store double %20, ptr %gep, align 8 ; └└ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:954 ; ┌ @ int.jl:87 within `+` %21 = add i64 %value_phi47107, 1 br label %L251 L251: ; preds = %L228, %L209 %value_phi82 = phi i64 [ %21, %L228 ], [ %value_phi47107, %L209 ] ; └ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:956 ; ┌ @ int.jl:87 within `+` %22 = add i64 %value_phi48108, 1 ; └ ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:951 ; ┌ @ int.jl:560 within `<=` %.not100 = icmp sgt i64 %22, %.size50.0.copyload ; └ br i1 %.not100, label %L258, label %L209 L255: ; preds = %L27, %top ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 %23 = call [1 x ptr] @j_AssertionError_19093(ptr nonnull @"jl_global#19094.jit") %gc_slot_addr_0 = getelementptr inbounds nuw i8, ptr %gcframe2, i64 16 %24 = extractvalue [1 x ptr] %23, 0 store ptr %24, ptr %gc_slot_addr_0, align 8 %ptls_field = getelementptr inbounds nuw i8, ptr %tls_pgcstack, i64 16 %ptls_load = load ptr, ptr %ptls_field, align 8 %"box::AssertionError" = call noalias nonnull align 8 dereferenceable(16) ptr @ijl_gc_small_alloc(ptr %ptls_load, i32 360, i32 16, i64 140585071995952) #8 %"box::AssertionError.tag_addr" = getelementptr inbounds i8, ptr %"box::AssertionError", i64 -8 store atomic i64 140585071995952, ptr %"box::AssertionError.tag_addr" unordered, align 8 store ptr %24, ptr %"box::AssertionError", align 8 store ptr null, ptr %gc_slot_addr_0, align 8 call void @ijl_throw(ptr nonnull %"box::AssertionError") unreachable L258: ; preds = %L251, %L189.preheader %frame.prev122 = load ptr, ptr %frame.prev, align 8 store ptr %frame.prev122, ptr %tls_pgcstack, align 8 ; @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:958 ret ptr %0 } .file "vcompress!" .section .ltext,"axl",@progbits .globl "japi1_vcompress!_19096" # -- Begin function japi1_vcompress!_19096 .p2align 4 .type "japi1_vcompress!_19096",@function "japi1_vcompress!_19096": # @"japi1_vcompress!_19096" ; Function Signature: vcompress!(Array{Float64, 1}, Array{Bool, 1}, Array{Float64, 1}) ; ┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:939 within `vcompress!` # %bb.0: # %top #DEBUG_VALUE: vcompress!:dest <- undef #DEBUG_VALUE: vcompress!:src <- undef push rbp mov rbp, rsp push r14 push rbx sub rsp, 48 vxorps xmm0, xmm0, xmm0 #APP mov rax, qword ptr fs:[0] #NO_APP lea rcx, [rbp - 48] vmovaps xmmword ptr [rbp - 48], xmm0 mov qword ptr [rbp - 32], 0 mov qword ptr [rbp - 56], rsi mov r14, qword ptr [rax - 8] mov qword ptr [rbp - 48], 4 mov rax, qword ptr [r14] mov qword ptr [rbp - 40], rax mov qword ptr [r14], rcx mov rdi, qword ptr [rsi + 16] mov rax, qword ptr [rsi] ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 ; │┌ @ abstractarray.jl:102 within `axes` ; ││┌ @ essentials.jl:10 within `size` mov r9, qword ptr [rdi + 16] ; │└└ ; │┌ @ tuple.jl:540 within `==` ; ││┌ @ tuple.jl:544 within `_eq` ; │││┌ @ range.jl:1142 within `==` @ promotion.jl:637 cmp qword ptr [rax + 16], r9 ; │└└└ jne .LBB0_21 # %bb.1: # %L27 mov rsi, qword ptr [rsi + 8] ; │┌ @ tuple.jl:540 within `==` ; ││┌ @ tuple.jl:544 within `_eq` ; │││┌ @ range.jl:1142 within `==` @ promotion.jl:637 cmp r9, qword ptr [rsi + 16] ; │└└└ jne .LBB0_21 # %bb.2: # %L29 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:941 ; │┌ @ int.jl:87 within `+` lea r8, [r9 - 3] ; │└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:945 ; │┌ @ int.jl:560 within `<=` test r8, r8 ; │└ jle .LBB0_3 # %bb.11: # %L37.preheader mov edx, 1 mov ecx, 1 jmp .LBB0_12 .p2align 4 .LBB0_20: # %else17 # in Loop: Header=BB0_12 Depth=1 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:948 ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/simdvec.jl:496 within `sum` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/simdvec.jl:486 within `reduce` ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:1099 within `reduce_add` ; ││││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:1129 within `macro expansion` popcnt r9d, r9d ; │└└└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:949 ; │┌ @ int.jl:87 within `+` add rdx, 4 ; │└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:948 ; │┌ @ int.jl:87 within `+` add rcx, r9 ; │└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:945 ; │┌ @ int.jl:560 within `<=` cmp rdx, r8 ; │└ jg .LBB0_4 .LBB0_12: # %L37 # =>This Inner Loop Header: Depth=1 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:946 ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:307 within `getindex` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:291 within `_pointer` ; │││┌ @ abstractarray.jl:1276 within `pointer` @ abstractarray.jl:1273 ; ││││┌ @ pointer.jl:63 within `cconvert` mov r9, qword ptr [rsi] ; │└└└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:947 ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:307 within `getindex` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:291 within `_pointer` ; │││┌ @ abstractarray.jl:1276 within `pointer` @ abstractarray.jl:1273 ; ││││┌ @ pointer.jl:63 within `cconvert` mov r10, qword ptr [rdi] ; │└└└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:946 ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:307 within `getindex` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 within `vload` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:51 ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:477 within `load` ; ││││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:486 within `macro expansion` vpmovzxbd xmm1, dword ptr [r9 + rdx - 1] # xmm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero,mem[2],zero,zero,zero,mem[3],zero,zero,zero ; │└└└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:947 ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:307 within `getindex` ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 within `vload` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:50 @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:51 ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:477 within `load` ; ││││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:486 within `macro expansion` vmovups ymm0, ymmword ptr [r10 + 8*rdx - 8] ; │└└└└ ; │┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:112 within `vstorec` ; ││┌ @ abstractarray.jl:1276 within `pointer` @ abstractarray.jl:1273 ; │││┌ @ pointer.jl:63 within `cconvert` mov r9, qword ptr [rax] ; │││└ ; │││ @ abstractarray.jl:1276 within `pointer` ; │││┌ @ pointer.jl:314 within `+` lea r10, [r9 + 8*rcx - 8] ; ││└└ ; ││ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:113 within `vstorec` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/arrayops.jl:105 ; ││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:663 within `maskedcompressstore` ; │││┌ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/src/LLVM_intrinsics.jl:680 within `macro expansion` vpslld xmm1, xmm1, 31 vmovmskps r9d, xmm1 test r9b, 1 je .LBB0_14 # %bb.13: # %cond.store # in Loop: Header=BB0_12 Depth=1 vmovlps qword ptr [r10], xmm0 add r10, 8 .LBB0_14: # %else # in Loop: Header=BB0_12 Depth=1 test r9b, 2 je .LBB0_16 # %bb.15: # %cond.store10 # in Loop: Header=BB0_12 Depth=1 vmovhps qword ptr [r10], xmm0 add r10, 8 .LBB0_16: # %else11 # in Loop: Header=BB0_12 Depth=1 vextractf128 xmm0, ymm0, 1 test r9b, 4 je .LBB0_18 # %bb.17: # %cond.store13 # in Loop: Header=BB0_12 Depth=1 vmovlps qword ptr [r10], xmm0 add r10, 8 .LBB0_18: # %else14 # in Loop: Header=BB0_12 Depth=1 test r9b, 8 je .LBB0_20 # %bb.19: # %cond.store16 # in Loop: Header=BB0_12 Depth=1 vmovhps qword ptr [r10], xmm0 jmp .LBB0_20 .LBB0_4: # %L189.preheader.loopexit mov r9, qword ptr [rdi + 16] ; │└└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:951 ; │┌ @ int.jl:560 within `<=` cmp rdx, r9 ; │└ jg .LBB0_10 jmp .LBB0_6 .LBB0_3: mov ecx, 1 mov edx, 1 ; │┌ @ int.jl:560 within `<=` cmp rdx, r9 ; │└ jle .LBB0_6 .LBB0_10: # %L258 mov rcx, qword ptr [rbp - 40] mov qword ptr [r14], rcx ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:958 add rsp, 48 pop rbx pop r14 pop rbp vzeroupper ret .LBB0_6: # %L209.lr.ph mov rsi, qword ptr [rsi] mov rdi, qword ptr [rdi] mov r8, qword ptr [rax] jmp .LBB0_7 .p2align 4 .LBB0_9: # %L251 # in Loop: Header=BB0_7 Depth=1 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:956 ; │┌ @ int.jl:87 within `+` inc rdx ; │└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:951 ; │┌ @ int.jl:560 within `<=` cmp rdx, r9 ; │└ jg .LBB0_10 .LBB0_7: # %L209 # =>This Inner Loop Header: Depth=1 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:952 test byte ptr [rsi + rdx - 1], 1 je .LBB0_9 # %bb.8: # %L228 # in Loop: Header=BB0_7 Depth=1 ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:953 ; │┌ @ essentials.jl:965 within `getindex` vmovsd xmm0, qword ptr [rdi + 8*rdx - 8] # xmm0 = mem[0],zero ; │└ ; │┌ @ array.jl:1020 within `setindex!` ; ││┌ @ array.jl:1025 within `_setindex!` vmovsd qword ptr [r8 + 8*rcx - 8], xmm0 ; │└└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:954 ; │┌ @ int.jl:87 within `+` inc rcx jmp .LBB0_9 .LBB0_21: # %L255 ; │└ ; │ @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 within `vcompress!` @ /home/pkgeval/.julia/packages/SIMD/UiGbs/test/runtests.jl:940 movabs rdi, offset ".Ljl_global#19111.jit" movabs rax, offset j_AssertionError_19110 call rax mov qword ptr [rbp - 32], rax mov rbx, rax movabs rax, offset ijl_gc_small_alloc mov esi, 360 mov edx, 16 mov rdi, qword ptr [r14 + 16] movabs r14, 140585071995952 mov rcx, r14 call rax movabs rcx, offset ijl_throw mov rdi, rax mov qword ptr [rax - 8], r14 mov qword ptr [rax], rbx mov qword ptr [rbp - 32], 0 call rcx .Lfunc_end0: .size "japi1_vcompress!_19096", .Lfunc_end0-"japi1_vcompress!_19096" ; └ # -- End function .set ".Ljl_global#19111.jit", 140584955091216 .size ".Ljl_global#19111.jit", 8 .set ".L+Core.AssertionError#19112.jit", 140585071995952 .size ".L+Core.AssertionError#19112.jit", 8 .section ".note.GNU-stack","",@progbits Test Summary: | Pass Total Time Real-world examples | 17 17 1.5s Test Summary: | Pass Total Time Vector shuffles | 222 222 1.5s Test Summary: | Pass Total Time Contiguous ReinterpretArrays load/store | 6 6 0.7s Test Summary: | Pass Total Time funnel shift | 6 6 0.1s Test Summary: | Pass Total Time fastmath min | 2 2 0.0s Test Summary: | Pass Total Time bitmask | 10 10 0.7s julia: /source/src/gf.c:2882: jl_method_table_activate: Assertion `(__extension__ ({ __auto_type __atomic_load_ptr = (&method->dispatch_status); __typeof__ (*__atomic_load_ptr) __atomic_load_tmp; __atomic_load (__atomic_load_ptr, &__atomic_load_tmp, (memory_order_relaxed)); __atomic_load_tmp; }) & 0b0001) == 0' failed. [88] signal 6 (-6): Aborted in expression starting at /home/pkgeval/.julia/packages/SIMD/UiGbs/test/opencl.jl:1 unknown function (ip: 0x7fdc958ebebc) at /lib/x86_64-linux-gnu/libc.so.6 gsignal at /lib/x86_64-linux-gnu/libc.so.6 (unknown line) abort at /lib/x86_64-linux-gnu/libc.so.6 (unknown line) unknown function (ip: 0x7fdc95887394) at /lib/x86_64-linux-gnu/libc.so.6 __assert_fail at /lib/x86_64-linux-gnu/libc.so.6 (unknown line) jl_method_table_activate at /source/src/gf.c:2882 jl_activate_methods at /source/src/staticdata_utils.c:915 [inlined] jl_restore_package_image_from_stream at /source/src/staticdata.c:4410 ijl_restore_incremental at /source/src/staticdata.c:4462 #_include_from_serialized#884 at ./loading.jl:1468 _include_from_serialized at ./loading.jl:1440 [inlined] _tryrequire_from_serialized at ./loading.jl:2203 __require_prelocked at ./loading.jl:2947 _require_prelocked at ./loading.jl:2725 _require_prelocked at ./loading.jl:2719 [inlined] run_extension_callbacks at ./loading.jl:1771 run_extension_callbacks at ./loading.jl:1808 run_package_callbacks at ./loading.jl:1624 _tryrequire_from_serialized at ./loading.jl:2158 _tryrequire_from_serialized at ./loading.jl:2196 __require_prelocked at ./loading.jl:2947 _require_prelocked at ./loading.jl:2725 macro expansion at ./loading.jl:2653 [inlined] macro expansion at ./lock.jl:376 [inlined] __require at ./loading.jl:2617 jfptr___require_75046.1 at /opt/julia/lib/julia/sys.so (unknown line) _jl_invoke at /source/src/gf.c:4120 [inlined] ijl_apply_generic at /source/src/gf.c:4317 jl_apply at /source/src/julia.h:2285 [inlined] jl_f_invoke_in_world at /source/src/builtins.c:908 require at ./loading.jl:2593 [inlined] eval_import_path at ./module.jl:36 [inlined] eval_import_path_all at ./module.jl:60 _eval_using at ./module.jl:137 [inlined] _eval_using at ./module.jl:137 jfptr__eval_using_47080.1 at /opt/julia/lib/julia/sys.so (unknown line) _jl_invoke at /source/src/gf.c:4120 [inlined] ijl_apply_generic at /source/src/gf.c:4317 jl_apply at /source/src/julia.h:2285 [inlined] do_call at /source/src/interpreter.c:123 eval_value at /source/src/interpreter.c:243 eval_stmt_value at /source/src/interpreter.c:194 [inlined] eval_body at /source/src/interpreter.c:693 jl_interpret_toplevel_thunk at /source/src/interpreter.c:884 ijl_eval_thunk at /source/src/toplevel.c:765 jl_toplevel_eval_flex at /source/src/toplevel.c:712 jl_eval_toplevel_stmts at /source/src/toplevel.c:602 jl_toplevel_eval_flex at /source/src/toplevel.c:684 ijl_toplevel_eval at /source/src/toplevel.c:779 ijl_toplevel_eval_in at /source/src/toplevel.c:824 eval at ./boot.jl:489 include_string at ./loading.jl:3150 _jl_invoke at /source/src/gf.c:4120 [inlined] ijl_apply_generic at /source/src/gf.c:4317 _include at ./loading.jl:3210 include at ./Base.jl:310 IncludeInto at ./Base.jl:311 jfptr_IncludeInto_57323.1 at /opt/julia/lib/julia/sys.so (unknown line) _jl_invoke at /source/src/gf.c:4120 [inlined] ijl_apply_generic at /source/src/gf.c:4317 jl_apply at /source/src/julia.h:2285 [inlined] do_call at /source/src/interpreter.c:123 eval_value at /source/src/interpreter.c:243 eval_stmt_value at /source/src/interpreter.c:194 [inlined] eval_body at /source/src/interpreter.c:693 jl_interpret_toplevel_thunk at /source/src/interpreter.c:884 ijl_eval_thunk at /source/src/toplevel.c:765 jl_toplevel_eval_flex at /source/src/toplevel.c:712 jl_eval_toplevel_stmts at /source/src/toplevel.c:602 jl_toplevel_eval_flex at /source/src/toplevel.c:684 ijl_toplevel_eval at /source/src/toplevel.c:779 ijl_toplevel_eval_in at /source/src/toplevel.c:824 eval at ./boot.jl:489 include_string at ./loading.jl:3150 _jl_invoke at /source/src/gf.c:4120 [inlined] ijl_apply_generic at /source/src/gf.c:4317 _include at ./loading.jl:3210 include at ./Base.jl:310 IncludeInto at ./Base.jl:311 jfptr_IncludeInto_57323.1 at /opt/julia/lib/julia/sys.so (unknown line) _jl_invoke at /source/src/gf.c:4120 [inlined] ijl_apply_generic at /source/src/gf.c:4317 jl_apply at /source/src/julia.h:2285 [inlined] do_call at /source/src/interpreter.c:123 eval_value at /source/src/interpreter.c:243 eval_stmt_value at /source/src/interpreter.c:194 [inlined] eval_body at /source/src/interpreter.c:693 jl_interpret_toplevel_thunk at /source/src/interpreter.c:884 ijl_eval_thunk at /source/src/toplevel.c:765 jl_toplevel_eval_flex at /source/src/toplevel.c:712 jl_eval_toplevel_stmts at /source/src/toplevel.c:602 jl_toplevel_eval_flex at /source/src/toplevel.c:684 ijl_toplevel_eval at /source/src/toplevel.c:779 ijl_toplevel_eval_in at /source/src/toplevel.c:824 eval at ./boot.jl:489 exec_options at ./client.jl:310 _start at ./client.jl:585 jfptr__start_73298.1 at /opt/julia/lib/julia/sys.so (unknown line) _jl_invoke at /source/src/gf.c:4120 [inlined] ijl_apply_generic at /source/src/gf.c:4317 jl_apply at /source/src/julia.h:2285 [inlined] true_main at /source/src/jlapi.c:971 jl_repl_entrypoint at /source/src/jlapi.c:1138 main at /source/cli/loader_exe.c:58 unknown function (ip: 0x7fdc95888249) at /lib/x86_64-linux-gnu/libc.so.6 __libc_start_main at /lib/x86_64-linux-gnu/libc.so.6 (unknown line) unknown function (ip: 0x4010b8) at /workspace/srcdir/glibc-2.17/csu/../sysdeps/x86_64/start.S Allocations: 49635292 (Pool: 49634461; Big: 831); GC: 26 Testing failed after 238.47s ERROR: LoadError: Package SIMD errored during testing (received signal: 6) Stacktrace: [1] pkgerror(msg::String) @ Pkg.Types /opt/julia/share/julia/stdlib/v1.14/Pkg/src/Types.jl:68 [2] test(ctx::Pkg.Types.Context, pkgs::Vector{PackageSpec}; coverage::Bool, julia_args::Cmd, test_args::Cmd, test_fn::Nothing, force_latest_compatible_version::Bool, allow_earlier_backwards_compatible_versions::Bool, allow_reresolve::Bool) @ Pkg.Operations /opt/julia/share/julia/stdlib/v1.14/Pkg/src/Operations.jl:3122 [3] test @ /opt/julia/share/julia/stdlib/v1.14/Pkg/src/Operations.jl:2987 [inlined] [4] test(ctx::Pkg.Types.Context, pkgs::Vector{PackageSpec}; coverage::Bool, test_fn::Nothing, julia_args::Cmd, test_args::Cmd, force_latest_compatible_version::Bool, allow_earlier_backwards_compatible_versions::Bool, allow_reresolve::Bool, kwargs::@Kwargs{io::IOContext{IO}}) @ Pkg.API /opt/julia/share/julia/stdlib/v1.14/Pkg/src/API.jl:572 [5] kwcall(::@NamedTuple{julia_args::Cmd, io::IOContext{IO}}, ::typeof(Pkg.API.test), ctx::Pkg.Types.Context, pkgs::Vector{PackageSpec}) @ Pkg.API /opt/julia/share/julia/stdlib/v1.14/Pkg/src/API.jl:548 [6] test(pkgs::Vector{PackageSpec}; io::IOContext{IO}, kwargs::@Kwargs{julia_args::Cmd}) @ Pkg.API /opt/julia/share/julia/stdlib/v1.14/Pkg/src/API.jl:172 [7] kwcall(::@NamedTuple{julia_args::Cmd}, ::typeof(Pkg.API.test), pkgs::Vector{PackageSpec}) @ Pkg.API /opt/julia/share/julia/stdlib/v1.14/Pkg/src/API.jl:161 [8] test(pkgs::Vector{String}; kwargs::@Kwargs{julia_args::Cmd}) @ Pkg.API /opt/julia/share/julia/stdlib/v1.14/Pkg/src/API.jl:160 [9] test @ /opt/julia/share/julia/stdlib/v1.14/Pkg/src/API.jl:160 [inlined] [10] kwcall(::@NamedTuple{julia_args::Cmd}, ::typeof(Pkg.API.test), pkg::String) @ Pkg.API /opt/julia/share/julia/stdlib/v1.14/Pkg/src/API.jl:159 [11] top-level scope @ /PkgEval.jl/scripts/evaluate.jl:237 [12] include(mod::Module, _path::String) @ Base ./Base.jl:309 [13] exec_options(opts::Base.JLOptions) @ Base ./client.jl:344 [14] _start() @ Base ./client.jl:585 in expression starting at /PkgEval.jl/scripts/evaluate.jl:228 PkgEval crashed after 355.5s: the process was aborted