Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow default-initializing Thrust vectors #4183

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Mar 19, 2025

I was having this changeset for a while now, but didn't push it, since the discussion in #1992 and among the team was somewhat controversial. However, I keep getting asked about this every now and then, so here is it: an overload for a Thrust vector's constructor and resize member function to turn value initialization into default initialization. I refer to #1992 for rational and discussion.

Fixes: #1992

Compiling dv.cu:

#include <thrust/device_vector.h>
int main() {
  thrust::device_vector<int> v(10, thrust::default_init);
  v.resize(100, thrust::default_init);
}

with

nvcc -Ithrust -Icub -Ilibcudacxx/include dv.cu

and dumping the SASS shows that only code for cub::detail::EmptyKernel is generated.

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner March 19, 2025 03:26
Comment on lines 48 to 52
struct default_init_t
{};

constexpr default_init_t default_init;

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Given the range of opinions on this feature, I am also fine with wrapping those two entities in an experimental namespace.

Copy link
Contributor

🟨 CI finished in 2h 55m: Pass: 94%/97 | Total: 2d 18h | Avg: 41m 06s | Max: 1h 53m | Hits: 68%/128171
  • 🟨 cub: Pass: 88%/45 | Total: 1d 15h | Avg: 52m 29s | Max: 1h 53m | Hits: 72%/47670

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  88%/43  | Total:  1d 13h | Avg: 52m 26s | Max:  1h 53m | Hits:  72%/45226 
      🟩 arm64              Pass: 100%/2   | Total:  1h 47m | Avg: 53m 32s | Max:  1h 00m | Hits:  69%/2444  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 51m | Avg: 58m 12s | Max:  1h 10m | Hits:  70%/5940  
      🟩 12.6               Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 18m | Hits:  69%/2260  
      🔍 12.8               Pass:  86%/38  | Total:  1d 07h | Avg: 50m 28s | Max:  1h 53m | Hits:  72%/39470 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 07m | Hits:  75%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 51m | Avg: 58m 12s | Max:  1h 10m | Hits:  70%/5940  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 18m | Hits:  69%/2260  
      🔍 nvcc12.8           Pass:  86%/36  | Total:  1d 05h | Avg: 49m 41s | Max:  1h 53m | Hits:  72%/37362 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 07m | Hits:  75%/2108  
      🔍 nvcc               Pass:  88%/43  | Total:  1d 13h | Avg: 51m 56s | Max:  1h 53m | Hits:  72%/45562 
    🔍 sm: 90 🔍
      🔍 90                 Pass:  66%/3   | Total:  1h 10m | Avg: 23m 31s | Max: 24m 15s | Hits:  84%/2444  
      🟩 90;90a;100         Pass: 100%/1   | Total: 46m 50s | Avg: 46m 50s | Max: 46m 50s | Hits:  69%/1222  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 19h 13m | Avg: 57m 39s | Max:  1h 18m | Hits:  70%/23662 
      🔍 20                 Pass:  80%/25  | Total: 20h 09m | Avg: 48m 21s | Max:  1h 53m | Hits:  74%/24008 
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 18m | Avg:  1h 04m | Max:  1h 10m | Hits:  69%/4896  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 06m | Hits:  69%/2444  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits:  69%/2444  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 58m | Avg: 59m 20s | Max:  1h 00m | Hits:  69%/2444  
      🟨 Clang18            Pass:  85%/7   | Total:  6h 06m | Avg: 52m 25s | Max:  1h 10m | Hits:  76%/6996  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 38m | Avg: 49m 12s | Max: 51m 51s | Hits:  69%/2448  
      🟩 GCC8               Pass: 100%/1   | Total: 47m 23s | Avg: 47m 23s | Max: 47m 23s | Hits:  69%/1224  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 45m | Avg: 52m 51s | Max: 52m 56s | Hits:  69%/2448  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 33m | Avg: 46m 47s | Max: 47m 52s | Hits:  69%/2448  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 39m | Avg: 49m 31s | Max: 51m 37s | Hits:  69%/2444  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 43m | Avg: 51m 32s | Max: 52m 01s | Hits:  69%/2444  
      🟨 GCC13              Pass:  63%/11  | Total:  7h 23m | Avg: 40m 18s | Max:  1h 53m | Hits:  77%/8554  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 50m | Avg: 55m 09s | Max: 55m 38s | Hits:  75%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 14s | Max: 57m 27s | Hits:  75%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 18m | Hits:  69%/2260  
    🟨 cxx_family
      🟨 Clang              Pass:  94%/17  | Total: 16h 33m | Avg: 58m 27s | Max:  1h 10m | Hits:  72%/19224 
      🟨 GCC                Pass:  81%/22  | Total: 16h 30m | Avg: 45m 01s | Max:  1h 53m | Hits:  72%/22010 
      🟩 MSVC               Pass: 100%/4   | Total:  3h 44m | Avg: 56m 11s | Max: 57m 27s | Hits:  75%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 18m | Hits:  69%/2260  
    🟨 gpu
      🟨 h100               Pass:  66%/3   | Total:  1h 10m | Avg: 23m 31s | Max: 24m 15s | Hits:  84%/2444  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 08h | Avg: 57m 24s | Max:  1h 18m | Hits:  70%/40338 
      🟨 rtxa6000           Pass:  50%/8   | Total:  5h 39m | Avg: 42m 28s | Max:  1h 53m | Hits:  84%/4888  
    🟨 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 10h | Avg: 56m 25s | Max:  1h 18m | Hits:  70%/44004 
      🟥 DeviceLaunch       Pass:   0%/1   | Total:  1h 53m | Avg:  1h 53m | Max:  1h 53m
      🟥 GraphCapture       Pass:   0%/1   | Total: 17m 45s | Avg: 17m 45s | Max: 17m 45s
      🟥 HostLaunch         Pass:   0%/3   | Total:  1h 16m | Avg: 25m 20s | Max: 26m 16s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 07m | Avg: 22m 35s | Max: 24m 10s | Hits:  99%/3666  
    
  • 🟩 thrust: Pass: 100%/45 | Total: 1d 01h | Avg: 33m 50s | Max: 1h 09m | Hits: 65%/80181

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 40m 47s | Avg: 20m 23s | Max: 29m 13s | Hits:  78%/3566  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 00h | Avg: 34m 01s | Max:  1h 09m | Hits:  66%/76616 
      🟩 arm64              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 09s | Max: 31m 35s | Hits:  56%/3565  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 17m | Avg: 39m 30s | Max:  1h 00m | Hits:  67%/8906  
      🟩 12.6               Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 09m | Hits:  52%/3564  
      🟩 12.8               Pass: 100%/38  | Total: 19h 56m | Avg: 31m 29s | Max:  1h 08m | Hits:  66%/67711 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 00m | Avg: 30m 10s | Max: 31m 43s | Hits:  56%/3564  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 17m | Avg: 39m 30s | Max:  1h 00m | Hits:  67%/8906  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 09m | Hits:  52%/3564  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 18h 56m | Avg: 31m 34s | Max:  1h 08m | Hits:  66%/64147 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 10s | Max: 31m 43s | Hits:  56%/3564  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 00h | Avg: 34m 01s | Max:  1h 09m | Hits:  66%/76617 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 10m | Avg: 32m 38s | Max: 33m 40s | Hits:  64%/7128  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 44s | Max: 33m 03s | Hits:  56%/3564  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 24s | Max: 34m 29s | Hits:  56%/3564  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 32s | Max: 33m 16s | Hits:  56%/3564  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 58m | Avg: 25m 27s | Max: 35m 26s | Hits:  69%/12474 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 16s | Max: 35m 19s | Hits:  63%/3566  
      🟩 GCC8               Pass: 100%/1   | Total: 32m 16s | Avg: 32m 16s | Max: 32m 16s | Hits:  56%/1783  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 26s | Max: 35m 29s | Hits:  65%/3566  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 54s | Max: 37m 21s | Hits:  56%/3566  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 49s | Max: 32m 50s | Hits:  56%/3566  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 18m | Avg: 39m 04s | Max: 40m 19s | Hits:  56%/3566  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 42m | Avg: 22m 12s | Max: 35m 13s | Hits:  78%/17830 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m | Hits:  61%/3552  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 33m | Avg: 51m 18s | Max:  1h 08m | Hits:  68%/5328  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 09m | Hits:  52%/3564  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 26m | Avg: 29m 46s | Max: 35m 26s | Hits:  63%/30294 
      🟩 GCC                Pass: 100%/21  | Total: 10h 11m | Avg: 29m 06s | Max: 40m 19s | Hits:  68%/37443 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 36m | Avg: 55m 20s | Max:  1h 08m | Hits:  65%/8880  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 09m | Hits:  52%/3564  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 33m 43s | Avg: 16m 51s | Max: 21m 54s | Hits:  78%/3566  
      🟩 rtx2080            Pass: 100%/33  | Total: 20h 51m | Avg: 37m 54s | Max:  1h 09m | Hits:  59%/58802 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 58m | Avg: 23m 50s | Max:  1h 08m | Hits:  84%/17813 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 23h 56m | Avg: 37m 47s | Max:  1h 09m | Hits:  59%/67709 
      🟩 TestCPU            Pass: 100%/3   | Total: 41m 28s | Avg: 13m 49s | Max: 25m 02s | Hits:  99%/5341  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 24s | Avg: 11m 21s | Max: 11m 49s | Hits:  99%/7131  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 33m 43s | Avg: 16m 51s | Max: 21m 54s | Hits:  78%/3566  
      🟩 90;90a;100         Pass: 100%/1   | Total: 29m 58s | Avg: 29m 58s | Max: 29m 58s | Hits:  80%/1783  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 13h 19m | Avg: 39m 58s | Max:  1h 09m | Hits:  58%/35631 
      🟩 20                 Pass: 100%/23  | Total: 11h 22m | Avg: 29m 41s | Max:  1h 08m | Hits:  70%/40984 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 40s | Avg: 4m 10s | Max: 4m 59s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 39s | Avg:  4m 49s | Max:  4m 59s
      🟩 arm64              Pass: 100%/2   | Total:  7m 01s | Avg:  3m 30s | Max:  3m 31s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 40s | Avg:  4m 10s | Max:  4m 59s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 40s | Avg:  4m 10s | Max:  4m 59s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 40s | Avg:  4m 10s | Max:  4m 59s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 40s | Avg:  4m 10s | Max:  4m 59s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 40s | Avg:  4m 10s | Max:  4m 59s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 40s | Avg:  4m 10s | Max:  4m 59s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 40s | Avg:  4m 10s | Max:  4m 59s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 11s | Avg:  4m 05s | Max:  4m 40s
      🟩 20                 Pass: 100%/2   | Total:  8m 29s | Avg:  4m 14s | Max:  4m 59s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 17m 15s | Avg: 8m 37s | Max: 14m 54s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 54s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 54s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 54s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 54s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 54s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 54s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 14m 54s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 08m | Avg: 1h 08m | Max: 1h 08m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 97)

# Runner
68 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

Copy link
Contributor

🟨 CI finished in 1h 30m: Pass: 98%/97 | Total: 2d 15h | Avg: 39m 06s | Max: 1h 12m | Hits: 69%/134281
  • 🟥 python: Pass: 0%/1 | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s
    🟥 ctk
      🟥 12.8               Pass:   0%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s
    🟥 cudacxx
      🟥 nvcc12.8           Pass:   0%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s
    🟥 gpu
      🟥 rtx2080            Pass:   0%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s
    
  • 🟩 cub: Pass: 100%/45 | Total: 1d 13h | Avg: 50m 19s | Max: 1h 12m | Hits: 74%/53780

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 11h | Avg: 50m 08s | Max:  1h 12m | Hits:  74%/51336 
      🟩 arm64              Pass: 100%/2   | Total:  1h 48m | Avg: 54m 09s | Max:  1h 01m | Hits:  69%/2444  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 47m | Avg: 57m 27s | Max:  1h 02m | Hits:  70%/5940  
      🟩 12.6               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
      🟩 12.8               Pass: 100%/38  | Total:  1d 06h | Avg: 48m 14s | Max:  1h 05m | Hits:  75%/45580 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  75%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 47m | Avg: 57m 27s | Max:  1h 02m | Hits:  70%/5940  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 04h | Avg: 47m 24s | Max:  1h 05m | Hits:  75%/43472 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  75%/2108  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 11h | Avg: 49m 43s | Max:  1h 12m | Hits:  74%/51672 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 10m | Avg:  1h 02m | Max:  1h 05m | Hits:  69%/4896  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  69%/2444  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 05m | Hits:  69%/2444  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  69%/2444  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 52m | Avg: 50m 23s | Max:  1h 04m | Hits:  80%/8218  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 40m | Avg: 50m 18s | Max: 51m 32s | Hits:  69%/2448  
      🟩 GCC8               Pass: 100%/1   | Total: 47m 41s | Avg: 47m 41s | Max: 47m 41s | Hits:  69%/1224  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 41m | Avg: 50m 37s | Max: 53m 22s | Hits:  69%/2448  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 39m | Avg: 49m 39s | Max: 49m 54s | Hits:  69%/2448  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 39m | Avg: 49m 58s | Max: 51m 21s | Hits:  69%/2444  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 38m | Avg: 49m 18s | Max: 50m 12s | Hits:  69%/2444  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 14m | Avg: 34m 03s | Max: 47m 57s | Hits:  82%/13442 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 29s | Max: 58m 47s | Hits:  75%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 56m | Avg: 58m 02s | Max: 58m 19s | Hits:  75%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 07m | Avg: 56m 55s | Max:  1h 05m | Hits:  73%/20446 
      🟩 GCC                Pass: 100%/22  | Total: 15h 21m | Avg: 41m 54s | Max: 53m 22s | Hits:  75%/26898 
      🟩 MSVC               Pass: 100%/4   | Total:  3h 51m | Avg: 57m 45s | Max: 58m 47s | Hits:  75%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 37m | Avg: 32m 39s | Max: 47m 57s | Hits:  78%/3666  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 08h | Avg: 56m 33s | Max:  1h 12m | Hits:  70%/40338 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 03m | Avg: 30m 25s | Max:  1h 00m | Hits:  92%/9776  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 10h | Avg: 55m 33s | Max:  1h 12m | Hits:  69%/44004 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 27s | Avg: 22m 27s | Max: 22m 27s | Hits:  99%/1222  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 41s | Avg: 17m 41s | Max: 17m 41s | Hits:  99%/1222  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 14m | Avg: 24m 43s | Max: 25m 35s | Hits:  99%/3666  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 34m | Avg: 31m 35s | Max: 47m 57s | Hits:  89%/3666  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 37m | Avg: 32m 39s | Max: 47m 57s | Hits:  78%/3666  
      🟩 90;90a;100         Pass: 100%/1   | Total: 46m 09s | Avg: 46m 09s | Max: 46m 09s | Hits:  67%/1222  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 18h 42m | Avg: 56m 08s | Max:  1h 11m | Hits:  70%/23662 
      🟩 20                 Pass: 100%/25  | Total: 19h 01m | Avg: 45m 40s | Max:  1h 12m | Hits:  78%/30118 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 1d 00h | Avg: 32m 57s | Max: 1h 02m | Hits: 65%/80181

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 41m 30s | Avg: 20m 45s | Max: 29m 57s | Hits:  78%/3566  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 23h 42m | Avg: 33m 04s | Max:  1h 02m | Hits:  65%/76616 
      🟩 arm64              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 31s | Max: 31m 52s | Hits:  56%/3565  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 11m | Avg: 38m 14s | Max: 58m 14s | Hits:  61%/8906  
      🟩 12.6               Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  52%/3564  
      🟩 12.8               Pass: 100%/38  | Total: 19h 29m | Avg: 30m 47s | Max:  1h 00m | Hits:  66%/67711 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 56m 07s | Avg: 28m 03s | Max: 28m 24s | Hits:  56%/3564  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 11m | Avg: 38m 14s | Max: 58m 14s | Hits:  61%/8906  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  52%/3564  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 18h 33m | Avg: 30m 56s | Max:  1h 00m | Hits:  66%/64147 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 07s | Avg: 28m 03s | Max: 28m 24s | Hits:  56%/3564  
      🟩 nvcc               Pass: 100%/43  | Total: 23h 47m | Avg: 33m 11s | Max:  1h 02m | Hits:  65%/76617 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 03s | Max: 33m 19s | Hits:  60%/7128  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 15s | Max: 32m 17s | Hits:  56%/3564  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 07s | Max: 33m 22s | Hits:  56%/3564  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 15s | Max: 34m 05s | Hits:  56%/3564  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 47m | Avg: 23m 53s | Max: 33m 09s | Hits:  69%/12474 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 21s | Max: 33m 26s | Hits:  60%/3566  
      🟩 GCC8               Pass: 100%/1   | Total: 33m 24s | Avg: 33m 24s | Max: 33m 24s | Hits:  56%/1783  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 51s | Max: 35m 18s | Hits:  66%/3566  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 09s | Max: 33m 15s | Hits:  56%/3566  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 10m | Avg: 35m 15s | Max: 35m 50s | Hits:  56%/3566  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 45s | Max: 38m 21s | Hits:  56%/3566  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 44m | Avg: 22m 29s | Max: 34m 51s | Hits:  78%/17830 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 56m | Avg: 58m 03s | Max: 58m 14s | Hits:  57%/3552  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 25m | Avg: 48m 25s | Max:  1h 00m | Hits:  68%/5328  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  52%/3564  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 12m | Avg: 28m 59s | Max: 34m 05s | Hits:  62%/30294 
      🟩 GCC                Pass: 100%/21  | Total: 10h 07m | Avg: 28m 54s | Max: 38m 21s | Hits:  68%/37443 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 21m | Avg: 52m 16s | Max:  1h 00m | Hits:  63%/8880  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  52%/3564  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 34m 00s | Avg: 17m 00s | Max: 22m 24s | Hits:  78%/3566  
      🟩 rtx2080            Pass: 100%/33  | Total: 20h 20m | Avg: 36m 59s | Max:  1h 02m | Hits:  58%/58802 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 48m | Avg: 22m 49s | Max:  1h 00m | Hits:  84%/17813 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 23h 15m | Avg: 36m 44s | Max:  1h 02m | Hits:  58%/67709 
      🟩 TestCPU            Pass: 100%/3   | Total: 41m 42s | Avg: 13m 54s | Max: 25m 39s | Hits:  99%/5341  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 32s | Avg: 11m 23s | Max: 11m 45s | Hits:  99%/7131  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 34m 00s | Avg: 17m 00s | Max: 22m 24s | Hits:  78%/3566  
      🟩 90;90a;100         Pass: 100%/1   | Total: 31m 08s | Avg: 31m 08s | Max: 31m 08s | Hits:  80%/1783  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 52m | Avg: 38m 37s | Max:  1h 02m | Hits:  58%/35631 
      🟩 20                 Pass: 100%/23  | Total: 11h 09m | Avg: 29m 06s | Max:  1h 00m | Hits:  70%/40984 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 17m 14s | Avg: 4m 18s | Max: 5m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 47s | Avg:  4m 53s | Max:  5m 04s
      🟩 arm64              Pass: 100%/2   | Total:  7m 27s | Avg:  3m 43s | Max:  3m 53s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 38s | Avg:  4m 19s | Max:  5m 04s
      🟩 20                 Pass: 100%/2   | Total:  8m 36s | Avg:  4m 18s | Max:  4m 43s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 27s | Avg: 8m 13s | Max: 14m 12s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 15s | Avg:  2m 15s | Max:  2m 15s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s | Hits:  98%/160   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 97)

# Runner
68 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@@ -45,6 +45,11 @@

THRUST_NAMESPACE_BEGIN

struct default_init_t
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

With cudax we went with uninit_t which I find easier for non-expert users

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought default initialization was the safer way, since you cannot skip constructors. I am fine with also providing uninit_t. Would you be fine if I opened a separate PR for that?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Honestly I am a bit hesitant here.

The main reason people want to use this is to avoid calling a kernel for instantiation. This breaking for non-trivial types will be a source of frustration / bugs.

We should rather give them the right tool that "works" everywhere, even if it is a sharp edge

Copy link
Contributor

🟩 CI finished in 2d 01h: Pass: 100%/97 | Total: 2d 16h | Avg: 39m 42s | Max: 1h 12m | Hits: 69%/134281
  • 🟩 cub: Pass: 100%/45 | Total: 1d 13h | Avg: 50m 19s | Max: 1h 12m | Hits: 74%/53780

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 11h | Avg: 50m 08s | Max:  1h 12m | Hits:  74%/51336 
      🟩 arm64              Pass: 100%/2   | Total:  1h 48m | Avg: 54m 09s | Max:  1h 01m | Hits:  69%/2444  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 47m | Avg: 57m 27s | Max:  1h 02m | Hits:  70%/5940  
      🟩 12.6               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
      🟩 12.8               Pass: 100%/38  | Total:  1d 06h | Avg: 48m 14s | Max:  1h 05m | Hits:  75%/45580 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  75%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 47m | Avg: 57m 27s | Max:  1h 02m | Hits:  70%/5940  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 04h | Avg: 47m 24s | Max:  1h 05m | Hits:  75%/43472 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  75%/2108  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 11h | Avg: 49m 43s | Max:  1h 12m | Hits:  74%/51672 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 10m | Avg:  1h 02m | Max:  1h 05m | Hits:  69%/4896  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  69%/2444  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 05m | Hits:  69%/2444  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  69%/2444  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 52m | Avg: 50m 23s | Max:  1h 04m | Hits:  80%/8218  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 40m | Avg: 50m 18s | Max: 51m 32s | Hits:  69%/2448  
      🟩 GCC8               Pass: 100%/1   | Total: 47m 41s | Avg: 47m 41s | Max: 47m 41s | Hits:  69%/1224  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 41m | Avg: 50m 37s | Max: 53m 22s | Hits:  69%/2448  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 39m | Avg: 49m 39s | Max: 49m 54s | Hits:  69%/2448  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 39m | Avg: 49m 58s | Max: 51m 21s | Hits:  69%/2444  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 38m | Avg: 49m 18s | Max: 50m 12s | Hits:  69%/2444  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 14m | Avg: 34m 03s | Max: 47m 57s | Hits:  82%/13442 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 29s | Max: 58m 47s | Hits:  75%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 56m | Avg: 58m 02s | Max: 58m 19s | Hits:  75%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 07m | Avg: 56m 55s | Max:  1h 05m | Hits:  73%/20446 
      🟩 GCC                Pass: 100%/22  | Total: 15h 21m | Avg: 41m 54s | Max: 53m 22s | Hits:  75%/26898 
      🟩 MSVC               Pass: 100%/4   | Total:  3h 51m | Avg: 57m 45s | Max: 58m 47s | Hits:  75%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 12m | Hits:  69%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 37m | Avg: 32m 39s | Max: 47m 57s | Hits:  78%/3666  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 08h | Avg: 56m 33s | Max:  1h 12m | Hits:  70%/40338 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 03m | Avg: 30m 25s | Max:  1h 00m | Hits:  92%/9776  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 10h | Avg: 55m 33s | Max:  1h 12m | Hits:  69%/44004 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 27s | Avg: 22m 27s | Max: 22m 27s | Hits:  99%/1222  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 41s | Avg: 17m 41s | Max: 17m 41s | Hits:  99%/1222  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 14m | Avg: 24m 43s | Max: 25m 35s | Hits:  99%/3666  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 34m | Avg: 31m 35s | Max: 47m 57s | Hits:  89%/3666  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 37m | Avg: 32m 39s | Max: 47m 57s | Hits:  78%/3666  
      🟩 90;90a;100         Pass: 100%/1   | Total: 46m 09s | Avg: 46m 09s | Max: 46m 09s | Hits:  67%/1222  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 18h 42m | Avg: 56m 08s | Max:  1h 11m | Hits:  70%/23662 
      🟩 20                 Pass: 100%/25  | Total: 19h 01m | Avg: 45m 40s | Max:  1h 12m | Hits:  78%/30118 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 1d 00h | Avg: 32m 57s | Max: 1h 02m | Hits: 65%/80181

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 41m 30s | Avg: 20m 45s | Max: 29m 57s | Hits:  78%/3566  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 23h 42m | Avg: 33m 04s | Max:  1h 02m | Hits:  65%/76616 
      🟩 arm64              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 31s | Max: 31m 52s | Hits:  56%/3565  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 11m | Avg: 38m 14s | Max: 58m 14s | Hits:  61%/8906  
      🟩 12.6               Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  52%/3564  
      🟩 12.8               Pass: 100%/38  | Total: 19h 29m | Avg: 30m 47s | Max:  1h 00m | Hits:  66%/67711 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 56m 07s | Avg: 28m 03s | Max: 28m 24s | Hits:  56%/3564  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 11m | Avg: 38m 14s | Max: 58m 14s | Hits:  61%/8906  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  52%/3564  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 18h 33m | Avg: 30m 56s | Max:  1h 00m | Hits:  66%/64147 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 07s | Avg: 28m 03s | Max: 28m 24s | Hits:  56%/3564  
      🟩 nvcc               Pass: 100%/43  | Total: 23h 47m | Avg: 33m 11s | Max:  1h 02m | Hits:  65%/76617 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 03s | Max: 33m 19s | Hits:  60%/7128  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 15s | Max: 32m 17s | Hits:  56%/3564  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 07s | Max: 33m 22s | Hits:  56%/3564  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 15s | Max: 34m 05s | Hits:  56%/3564  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 47m | Avg: 23m 53s | Max: 33m 09s | Hits:  69%/12474 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 21s | Max: 33m 26s | Hits:  60%/3566  
      🟩 GCC8               Pass: 100%/1   | Total: 33m 24s | Avg: 33m 24s | Max: 33m 24s | Hits:  56%/1783  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 09m | Avg: 34m 51s | Max: 35m 18s | Hits:  66%/3566  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 09s | Max: 33m 15s | Hits:  56%/3566  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 10m | Avg: 35m 15s | Max: 35m 50s | Hits:  56%/3566  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 45s | Max: 38m 21s | Hits:  56%/3566  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 44m | Avg: 22m 29s | Max: 34m 51s | Hits:  78%/17830 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 56m | Avg: 58m 03s | Max: 58m 14s | Hits:  57%/3552  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 25m | Avg: 48m 25s | Max:  1h 00m | Hits:  68%/5328  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  52%/3564  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 12m | Avg: 28m 59s | Max: 34m 05s | Hits:  62%/30294 
      🟩 GCC                Pass: 100%/21  | Total: 10h 07m | Avg: 28m 54s | Max: 38m 21s | Hits:  68%/37443 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 21m | Avg: 52m 16s | Max:  1h 00m | Hits:  63%/8880  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  52%/3564  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 34m 00s | Avg: 17m 00s | Max: 22m 24s | Hits:  78%/3566  
      🟩 rtx2080            Pass: 100%/33  | Total: 20h 20m | Avg: 36m 59s | Max:  1h 02m | Hits:  58%/58802 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 48m | Avg: 22m 49s | Max:  1h 00m | Hits:  84%/17813 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 23h 15m | Avg: 36m 44s | Max:  1h 02m | Hits:  58%/67709 
      🟩 TestCPU            Pass: 100%/3   | Total: 41m 42s | Avg: 13m 54s | Max: 25m 39s | Hits:  99%/5341  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 32s | Avg: 11m 23s | Max: 11m 45s | Hits:  99%/7131  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 34m 00s | Avg: 17m 00s | Max: 22m 24s | Hits:  78%/3566  
      🟩 90;90a;100         Pass: 100%/1   | Total: 31m 08s | Avg: 31m 08s | Max: 31m 08s | Hits:  80%/1783  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 52m | Avg: 38m 37s | Max:  1h 02m | Hits:  58%/35631 
      🟩 20                 Pass: 100%/23  | Total: 11h 09m | Avg: 29m 06s | Max:  1h 00m | Hits:  70%/40984 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 17m 14s | Avg: 4m 18s | Max: 5m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 47s | Avg:  4m 53s | Max:  5m 04s
      🟩 arm64              Pass: 100%/2   | Total:  7m 27s | Avg:  3m 43s | Max:  3m 53s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 17m 14s | Avg:  4m 18s | Max:  5m 04s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 38s | Avg:  4m 19s | Max:  5m 04s
      🟩 20                 Pass: 100%/2   | Total:  8m 36s | Avg:  4m 18s | Max:  4m 43s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 27s | Avg: 8m 13s | Max: 14m 12s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 27s | Avg:  8m 13s | Max: 14m 12s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 15s | Avg:  2m 15s | Max:  2m 15s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 12s | Avg: 14m 12s | Max: 14m 12s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 10m | Avg: 1h 10m | Max: 1h 10m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 97)

# Runner
68 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

[FEA]: Allow default initialization for thrust vectors
2 participants