Package evaluation of TextSearch on Julia 1.10.8 (92f03a4775*) started at 2025-02-25T12:01:35.568 ################################################################################ # Set-up # Installing PkgEval dependencies (TestEnv)... Set-up completed after 5.02s ################################################################################ # Installation # Installing TextSearch... Resolving package versions... Updating `~/.julia/environments/v1.10/Project.toml` [7f6f6c8a] + TextSearch v0.18.0 Updating `~/.julia/environments/v1.10/Manifest.toml` [79e6a3ab] + Adapt v4.2.0 [4fba245c] + ArrayInterface v7.18.0 [62783981] + BitTwiddlingConvenienceFunctions v0.1.6 [2a0fbf3d] + CPUSummary v0.2.6 [324d7699] + CategoricalArrays v0.10.8 [fb6a15b2] + CloseOpenIntervals v0.1.13 [f70d9fcc] + CommonWorldInvalidations v1.0.0 [34da2185] + Compat v4.16.0 [adafc99b] + CpuId v0.3.1 [a8cc5b0e] + Crayons v4.1.1 [9a962f9c] + DataAPI v1.16.0 [a93c6f00] + DataFrames v1.7.0 [864edb3b] + DataStructures v0.18.20 [e2d170a0] + DataValueInterfaces v1.0.0 [8bb1440f] + DelimitedFiles v1.9.1 [b4f34e82] + Distances v0.10.12 [ffbed154] + DocStringExtensions v0.9.3 [5789e2e9] + FileIO v1.16.6 [3e5b6fbb] + HostCPUFeatures v0.1.17 [615f187c] + IfElse v0.1.1 [842dd82b] + InlineStrings v1.4.3 [6d0fbc77] + Intersections v0.4.0 [b20bd276] + InvertedFiles v0.7.1 [41ab1584] + InvertedIndices v1.3.1 [92d709cd] + IrrationalConstants v0.2.4 [82899510] + IteratorInterfaceExtensions v1.0.0 ⌅ [033835bb] + JLD2 v0.4.54 [5d8de97f] + KCenters v0.9.0 [b964fa9f] + LaTeXStrings v1.4.0 [10f19ff3] + LayoutPointers v0.1.17 ⌅ [7f8f8fb0] + LearnBase v0.3.0 [2ab3a3ac] + LogExpFunctions v0.3.29 [bdcacae8] + LoopVectorization v0.12.171 ⌃ [9920b226] + MLDataPattern v0.5.4 [cc2ba9b6] + MLDataUtils v0.5.4 [66a33bbf] + MLLabelUtils v0.5.7 [1914dd2f] + MacroTools v0.5.15 [d125e4d3] + ManualMemory v0.1.8 [dbb5928d] + MappedArrays v0.4.2 [e1d29d7a] + Missings v1.2.0 [6fe1bfb0] + OffsetArrays v1.15.0 [bac558e1] + OrderedCollections v1.8.0 [d96e819e] + Parameters v0.12.3 [f517fe37] + Polyester v0.7.16 [1d0040c9] + PolyesterWeave v0.2.2 [2dfb63ee] + PooledArrays v1.4.3 [aea7be01] + PrecompileTools v1.2.1 [21216c6a] + Preferences v1.4.3 [08abe8d2] + PrettyTables v2.4.0 [189a3867] + Reexport v1.2.2 [ae029012] + Requires v1.3.0 [94e857df] + SIMDTypes v0.1.0 [476501e8] + SLEEFPirates v0.6.43 ⌅ [0e966ebe] + SearchModels v0.3.3 [91c51154] + SentinelArrays v1.4.8 [053f045d] + SimilaritySearch v0.11.10 [a2af1166] + SortingAlgorithms v1.2.1 [aedffcd0] + Static v1.1.1 [0d7ed370] + StaticArrayInterface v1.8.0 [82ae8749] + StatsAPI v1.7.0 ⌅ [2913bbd2] + StatsBase v0.33.21 [7792a7ef] + StrideArraysCore v0.5.7 [892a3eda] + StringManipulation v0.4.1 [3783bdb8] + TableTraits v1.0.1 [bd369af6] + Tables v1.12.0 [7f6f6c8a] + TextSearch v0.18.0 [8290d209] + ThreadingUtilities v0.5.2 [3bb67fe8] + TranscodingStreams v0.11.3 [3a884ed6] + UnPack v1.0.2 [3d5dd08c] + VectorizationBase v0.21.71 [0dad84c5] + ArgTools v1.1.1 [56f22d72] + Artifacts [2a0f44e3] + Base64 [ade2ca70] + Dates [8ba89e20] + Distributed [f43a241f] + Downloads v1.6.0 [7b1f6079] + FileWatching [9fa8497b] + Future [b77e0a4c] + InteractiveUtils [b27032c2] + LibCURL v0.6.4 [76f85450] + LibGit2 [8f399da3] + Libdl [37e2e46d] + LinearAlgebra [56ddb016] + Logging [d6f4376e] + Markdown [a63ad114] + Mmap [ca575930] + NetworkOptions v1.2.0 [44cfe95a] + Pkg v1.10.0 [de0858da] + Printf [3fa0cd96] + REPL [9a3f8284] + Random [ea8e919c] + SHA v0.7.0 [9e88b42a] + Serialization [6462fe0b] + Sockets [2f01184e] + SparseArrays v1.10.0 [10745b16] + Statistics v1.10.0 [fa267f1f] + TOML v1.0.3 [a4e569a6] + Tar v1.10.0 [8dfed614] + Test [cf7118a7] + UUIDs [4ec0a83e] + Unicode [e66e0078] + CompilerSupportLibraries_jll v1.1.1+0 [deac9b47] + LibCURL_jll v8.4.0+0 [e37daf67] + LibGit2_jll v1.6.4+0 [29816b5a] + LibSSH2_jll v1.11.0+1 [c8ffd9c3] + MbedTLS_jll v2.28.2+1 [14a3606d] + MozillaCACerts_jll v2023.1.10 [4536629a] + OpenBLAS_jll v0.3.23+4 [bea87d4a] + SuiteSparse_jll v7.2.1+1 [83775a58] + Zlib_jll v1.2.13+1 [8e850b90] + libblastrampoline_jll v5.11.0+0 [8e850ede] + nghttp2_jll v1.52.0+1 [3f19e933] + p7zip_jll v17.4.0+2 Info Packages marked with ⌃ and ⌅ have new versions available. Those with ⌃ may be upgradable, but those with ⌅ are restricted by compatibility constraints from upgrading. To see why use `status --outdated -m` Installation completed after 7.75s ################################################################################ # Precompilation # Precompiling PkgEval dependencies... Precompiling package dependencies... Precompilation completed after 173.82s ################################################################################ # Testing # Testing TextSearch Status `/tmp/jl_S7RL1X/Project.toml` [4c88cf16] Aqua v0.8.11 [324d7699] CategoricalArrays v0.10.8 [6d0fbc77] Intersections v0.4.0 [b20bd276] InvertedFiles v0.7.1 ⌅ [033835bb] JLD2 v0.4.54 [f517fe37] Polyester v0.7.16 [053f045d] SimilaritySearch v0.11.10 ⌅ [2913bbd2] StatsBase v0.33.21 [7f6f6c8a] TextSearch v0.18.0 [37e2e46d] LinearAlgebra [9a3f8284] Random [2f01184e] SparseArrays v1.10.0 [8dfed614] Test [4ec0a83e] Unicode Status `/tmp/jl_S7RL1X/Manifest.toml` [79e6a3ab] Adapt v4.2.0 [4c88cf16] Aqua v0.8.11 [4fba245c] ArrayInterface v7.18.0 [62783981] BitTwiddlingConvenienceFunctions v0.1.6 [2a0fbf3d] CPUSummary v0.2.6 [324d7699] CategoricalArrays v0.10.8 [fb6a15b2] CloseOpenIntervals v0.1.13 [f70d9fcc] CommonWorldInvalidations v1.0.0 [34da2185] Compat v4.16.0 [adafc99b] CpuId v0.3.1 [a8cc5b0e] Crayons v4.1.1 [9a962f9c] DataAPI v1.16.0 [a93c6f00] DataFrames v1.7.0 [864edb3b] DataStructures v0.18.20 [e2d170a0] DataValueInterfaces v1.0.0 [8bb1440f] DelimitedFiles v1.9.1 [b4f34e82] Distances v0.10.12 [ffbed154] DocStringExtensions v0.9.3 [5789e2e9] FileIO v1.16.6 [3e5b6fbb] HostCPUFeatures v0.1.17 [615f187c] IfElse v0.1.1 [842dd82b] InlineStrings v1.4.3 [6d0fbc77] Intersections v0.4.0 [b20bd276] InvertedFiles v0.7.1 [41ab1584] InvertedIndices v1.3.1 [92d709cd] IrrationalConstants v0.2.4 [82899510] IteratorInterfaceExtensions v1.0.0 ⌅ [033835bb] JLD2 v0.4.54 [5d8de97f] KCenters v0.9.0 [b964fa9f] LaTeXStrings v1.4.0 [10f19ff3] LayoutPointers v0.1.17 ⌅ [7f8f8fb0] LearnBase v0.3.0 [2ab3a3ac] LogExpFunctions v0.3.29 [bdcacae8] LoopVectorization v0.12.171 ⌃ [9920b226] MLDataPattern v0.5.4 [cc2ba9b6] MLDataUtils v0.5.4 [66a33bbf] MLLabelUtils v0.5.7 [1914dd2f] MacroTools v0.5.15 [d125e4d3] ManualMemory v0.1.8 [dbb5928d] MappedArrays v0.4.2 [e1d29d7a] Missings v1.2.0 [6fe1bfb0] OffsetArrays v1.15.0 [bac558e1] OrderedCollections v1.8.0 [d96e819e] Parameters v0.12.3 [f517fe37] Polyester v0.7.16 [1d0040c9] PolyesterWeave v0.2.2 [2dfb63ee] PooledArrays v1.4.3 [aea7be01] PrecompileTools v1.2.1 [21216c6a] Preferences v1.4.3 [08abe8d2] PrettyTables v2.4.0 [189a3867] Reexport v1.2.2 [ae029012] Requires v1.3.0 [94e857df] SIMDTypes v0.1.0 [476501e8] SLEEFPirates v0.6.43 ⌅ [0e966ebe] SearchModels v0.3.3 [91c51154] SentinelArrays v1.4.8 [053f045d] SimilaritySearch v0.11.10 [a2af1166] SortingAlgorithms v1.2.1 [aedffcd0] Static v1.1.1 [0d7ed370] StaticArrayInterface v1.8.0 [82ae8749] StatsAPI v1.7.0 ⌅ [2913bbd2] StatsBase v0.33.21 [7792a7ef] StrideArraysCore v0.5.7 [892a3eda] StringManipulation v0.4.1 [3783bdb8] TableTraits v1.0.1 [bd369af6] Tables v1.12.0 [7f6f6c8a] TextSearch v0.18.0 [8290d209] ThreadingUtilities v0.5.2 [3bb67fe8] TranscodingStreams v0.11.3 [3a884ed6] UnPack v1.0.2 [3d5dd08c] VectorizationBase v0.21.71 [0dad84c5] ArgTools v1.1.1 [56f22d72] Artifacts [2a0f44e3] Base64 [ade2ca70] Dates [8ba89e20] Distributed [f43a241f] Downloads v1.6.0 [7b1f6079] FileWatching [9fa8497b] Future [b77e0a4c] InteractiveUtils [b27032c2] LibCURL v0.6.4 [76f85450] LibGit2 [8f399da3] Libdl [37e2e46d] LinearAlgebra [56ddb016] Logging [d6f4376e] Markdown [a63ad114] Mmap [ca575930] NetworkOptions v1.2.0 [44cfe95a] Pkg v1.10.0 [de0858da] Printf [3fa0cd96] REPL [9a3f8284] Random [ea8e919c] SHA v0.7.0 [9e88b42a] Serialization [6462fe0b] Sockets [2f01184e] SparseArrays v1.10.0 [10745b16] Statistics v1.10.0 [fa267f1f] TOML v1.0.3 [a4e569a6] Tar v1.10.0 [8dfed614] Test [cf7118a7] UUIDs [4ec0a83e] Unicode [e66e0078] CompilerSupportLibraries_jll v1.1.1+0 [deac9b47] LibCURL_jll v8.4.0+0 [e37daf67] LibGit2_jll v1.6.4+0 [29816b5a] LibSSH2_jll v1.11.0+1 [c8ffd9c3] MbedTLS_jll v2.28.2+1 [14a3606d] MozillaCACerts_jll v2023.1.10 [4536629a] OpenBLAS_jll v0.3.23+4 [bea87d4a] SuiteSparse_jll v7.2.1+1 [83775a58] Zlib_jll v1.2.13+1 [8e850b90] libblastrampoline_jll v5.11.0+0 [8e850ede] nghttp2_jll v1.52.0+1 [3f19e933] p7zip_jll v17.4.0+2 Info Packages marked with ⌃ and ⌅ have new versions available. Those with ⌃ may be upgradable, but those with ⌅ are restricted by compatibility constraints from upgrading. Testing Running tests... Test Summary: | Pass Total Time Unbound type parameters | 1 1 0.5s Test Summary: | Pass Total Time Undefined exports | 1 1 0.0s Test Summary: | Pass Total Time Compare Project.toml and test/Project.toml | 1 1 0.0s Test Summary: | Pass Total Time Stale dependencies | 1 1 10.9s Test Summary: | Pass Total Time Compat bounds | 4 4 0.5s Test Summary: | Pass Total Time Persistent tasks | 1 1 19.2s Test Summary: | Pass Total Time DVEC | 109 109 2.7s Test Summary: | Pass Total Time individual tokenizers | 6 6 2.9s Test Summary: | Pass Total Time message vectors | 1 1 0.2s [ Info: (2, 1) (A.corpuslen, B.corpuslen, C.corpuslen) = (2, 1, 3) Test Summary: | Pass Total Time vocabulary of different kinds of docs | 7 7 3.6s Test Summary: | Pass Total Time Normalize and tokenize | 1 1 0.0s Test Summary: | Pass Total Time Normalize and tokenize bigrams and trigrams | 1 1 0.0s Test Summary: | Pass Total Time Normalize and tokenize | 1 1 0.0s text1 = "hello world!! @user;) #jello.world :)" tokens = TokenizedText{Vector{String}}(["hello !! ;)\ts", "world @user #jello\ts", "!! ;) .\ts", "@user #jello world\ts", ";) . :)\ts"]) text1 = "hello world!! @user;) #jello.world :)" tokens = TokenizedText{Vector{String}}(["hello !!", "world @user", "!! ;)", "@user #jello", ";) .", "#jello world", ". :)", "hello !! ;)", "world @user #jello", "!! ;) .", "@user #jello world", ";) . :)"]) Test Summary: | Pass Total Time Tokenize skipgrams | 2 2 0.3s Test Summary: | Pass Total Time vocabulary | 4 4 0.9s [ Info: =================== Test Summary: | Pass Total Time Vocabulary and BOW | 1 1 1.0s [ Info: ["la casa roja", "la casa verde", "la casa azul", "la manzana roja", "la pera verde esta rica", "la manzana verde esta rica", "la hoja verde"] [ Info: =================== Test Summary: |Time Approximate vocabulary | None 2.9s text1 => x = "hello world!! @user;) #jello.world :)" => Dict{UInt32, Float32}(0x00000005 => 0.30151135, 0x00000004 => 0.30151135, 0x00000007 => 0.30151135, 0x00000002 => 0.6030227, 0x00000009 => 0.30151135, 0x00000008 => 0.30151135, 0x00000001 => 0.30151135, 0x00000003 => 0.30151135) corpus = ["hello world :)", "@user;) excellent!!", "#jello world."] text1 = "hello world!! @user;) #jello.world :)" text2 = "a b c d e f g h i j k l m n o p q" text2 => v = "a b c d e f g h i j k l m n o p q" => Dict{UInt32, Float32}(0x00000000 => 1.0) Test Summary: | Pass Total Time Tokenizer, DVEC, and vectorize | 1 1 0.7s Test Summary: | Pass Total Time tokenize list of strings as a single message | 1 1 0.2s (length(corpus), length(corpus_bows)) = (3, 5) Test Summary: | Pass Total Time Tokenizer, DVEC, and vectorize | 2 2 2.1s (gw, lw, dot_, dot(x, y), x, y) = (BinaryGlobalWeighting(), FreqWeighting(), 0.3162, 0.3162277568183818, Dict{UInt32, Float32}(0x00000005 => 0.4472136, 0x00000004 => 0.8944272), Dict{UInt32, Float32}(0x00000005 => 0.70710677, 0x00000006 => 0.70710677)) (gw, lw, dot_, dot(x, y), x, y) = (BinaryGlobalWeighting(), TfWeighting(), 0.3162, 0.3162277568183818, Dict{UInt32, Float32}(0x00000005 => 0.4472136, 0x00000004 => 0.8944272), Dict{UInt32, Float32}(0x00000005 => 0.70710677, 0x00000006 => 0.70710677)) (gw, lw, dot_, dot(x, y), x, y) = (BinaryGlobalWeighting(), TpWeighting(), 0.3162, 0.3162277568183818, Dict{UInt32, Float32}(0x00000005 => 0.4472136, 0x00000004 => 0.8944272), Dict{UInt32, Float32}(0x00000005 => 0.70710677, 0x00000006 => 0.70710677)) (gw, lw, dot_, dot(x, y), x, y) = (IdfWeighting(), BinaryLocalWeighting(), 0.3668, 0.3668392932243272, Dict{UInt32, Float32}(0x00000005 => 0.5187891, 0x00000004 => 0.85490215), Dict{UInt32, Float32}(0x00000005 => 0.70710677, 0x00000006 => 0.70710677)) (gw, lw, dot_, dot(x, y), x, y) = (IdfWeighting(), TfWeighting(), 0.2053, 0.20530779014815081, Dict{UInt32, Float32}(0x00000005 => 0.29034907, 0x00000004 => 0.9569208), Dict{UInt32, Float32}(0x00000005 => 0.70710677, 0x00000006 => 0.70710677)) (gw, lw, dot_, dot(x, y), x, y) = (EntropyWeighting(), FreqWeighting(), 0.44456, 0.4445559329173179, Dict{UInt32, Float32}(0x00000005 => 0.4472136, 0x00000004 => 0.8944272), Dict{UInt32, Float32}(0x00000005 => 0.9940573, 0x00000006 => 0.10885762)) (gw, lw, dot_, dot(x, y), x, y) = (EntropyWeighting(), TfWeighting(), 0.44456, 0.4445559329173179, Dict{UInt32, Float32}(0x00000005 => 0.4472136, 0x00000004 => 0.8944272), Dict{UInt32, Float32}(0x00000005 => 0.9940573, 0x00000006 => 0.10885762)) (gw, lw, dot_, dot(x, y), x, y) = (EntropyWeighting(), TpWeighting(), 0.44456, 0.4445559329173179, Dict{UInt32, Float32}(0x00000005 => 0.4472136, 0x00000004 => 0.8944272), Dict{UInt32, Float32}(0x00000005 => 0.9940573, 0x00000006 => 0.10885762)) (gw, lw, dot_, dot(x, y), x, y) = (EntropyWeighting(), BinaryLocalWeighting(), 0.7029, 0.7029046440666136, Dict{UInt32, Float32}(0x00000005 => 0.70710677, 0x00000004 => 0.70710677), Dict{UInt32, Float32}(0x00000005 => 0.9940573, 0x00000006 => 0.10885762)) [ Info: ====== weight: [ Info: Float32[1.0, 1.0, 1.0, 1.0, 1.0, 0.109508395, 1.0, 1.0] [ Info: Float32[1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0] [ Info: ("====== token:", ["me", "gusta", "encanta", "lo", "odio", "lol", "!"]) [ Info: ("lo lo odio", "odio esto") ("=========", x, y, norm(x), norm(y)) = ("=========", Dict{UInt32, Float32}(0x00000005 => 0.70710677, 0x00000004 => 0.70710677), Dict{UInt32, Float32}(0x00000005 => 1.0), 0.99999994f0, 1.0f0) (gw, lw, dot(x, y), dot_, x, y) = (EntropyWeighting(), BinaryLocalWeighting(), 0.7071067690849304, 0.7071067690849304, Dict{UInt32, Float32}(0x00000005 => 0.70710677, 0x00000004 => 0.70710677), Dict{UInt32, Float32}(0x00000005 => 1.0)) [ Info: ====== weight: [ Info: Float32[0.6520767, 1.8744692, 1.1375035, 1.8744692, 1.1375035, 1.1375035, 1.8744692, 1.8744692] [ Info: Float32[1.8744692, 1.8744692, 1.8744692, 1.8744692] [ Info: ("====== token:", ["gusta", "lo", "lol", "!"]) [ Info: ("lo lo odio", "odio esto") ("=========", x, y, norm(x), norm(y)) = ("=========", Dict{UInt32, Float32}(0x00000002 => 1.0), Dict{UInt32, Float32}(0x00000000 => 1.0), 1.0f0, 1.0f0) (gw, lw, dot(x, y), dot_, x, y) = (IdfWeighting(), TfWeighting(), 0.0, 0.0, Dict{UInt32, Float32}(0x00000002 => 1.0), Dict{UInt32, Float32}(0x00000000 => 1.0)) Test Summary: | Pass Total Time Weighting schemes | 15 15 6.5s Test Summary: | Pass Total Time distances | 3 3 0.5s Test Summary: | Pass Total Time operations | 7 7 0.4s Test Summary: | Pass Total Time invindex | 1 1 2.0s Test Summary: | Pass Total Time centroid computing | 1 1 0.8s [ Info: 1 => "la casa roja" [ Info: 2 => "la casa verde" [ Info: 3 => "la casa azul" [ Info: 4 => "la manzana roja" [ Info: 5 => "la pera verde esta rica" [ Info: 6 => "la manzana verde esta rica" [ Info: 7 => "la hoja verde" invfile.voc = Vocabulary{TokenLookup}(TokenLookup(), TextConfig(true, false, false, true, true, false, false, true, 0, true, Int8[], Int8[1], Skipgram[], IdentityTokenTransformation()), ["casa", "roja", "verde", "manzana", "esta", "rica"], Int32[3, 2, 4, 2, 2, 2], Int32[3, 2, 4, 2, 2, 2], Dict{String, UInt32}("rica" => 0x00000006, "esta" => 0x00000005, "verde" => 0x00000003, "roja" => 0x00000002, "casa" => 0x00000001, "manzana" => 0x00000004), 7) invfile.bm25 = BM25(2.2f0, 0.3f0, 0.252f0, 1.0f0, 0x00000007) Test Summary: | Pass Total Time bm25 invindex | 2 2 2.9s collect(DistView(R.res)) = Float32[-3.2325513, -3.03588, -2.4077673] invfile.voc = Vocabulary{TokenLookup}(TokenLookup(), TextConfig(true, false, false, true, true, false, false, true, 0, true, Int8[], Int8[1], Skipgram[], IdentityTokenTransformation()), ["la", "casa", "roja", "verde", "azul", "manzana", "pera", "esta", "rica", "hoja"], Int32[7, 3, 2, 4, 1, 2, 1, 2, 2, 1], Int32[7, 3, 2, 4, 1, 2, 1, 2, 2, 1], Dict{String, UInt32}("rica" => 0x00000009, "esta" => 0x00000008, "verde" => 0x00000004, "roja" => 0x00000003, "hoja" => 0x0000000a, "casa" => 0x00000002, "pera" => 0x00000007, "manzana" => 0x00000006, "azul" => 0x00000005, "la" => 0x00000001), 7) invfile.bm25 = BM25(2.2f0, 0.3f0, 0.252f0, 1.0f0, 0x00000007) [ Info: --- load and save!!! ┌ Warning: the database was not stored and was not passed to loadindex └ @ SimilaritySearch ~/.julia/packages/SimilaritySearch/fkUaT/src/io.jl:97 Test Summary: | Pass Total Time bm25 invindex | 4 4 1m34.2s [ Info: FINISH Testing TextSearch tests passed Testing completed after 191.64s PkgEval succeeded after 399.62s