Skip to content

[GPU] Widen GPU offsets (part2)#4991

Open
h-sadia wants to merge 7 commits intomainfrom
hsadia/offset_widen_pt2
Open

[GPU] Widen GPU offsets (part2)#4991
h-sadia wants to merge 7 commits intomainfrom
hsadia/offset_widen_pt2

Conversation

@h-sadia
Copy link
Copy Markdown
Contributor

@h-sadia h-sadia commented Apr 9, 2026

Description

This is part 2 of widening offsets for openCL kernels that were left from part 1, find here.
Primitives touched in this part:

sum
shuffle
ref_deconv
RNN

This also patches a fix to (https://jira.devtools.intel.com/browse/MFDNN-14896) restraining the regression testing to GPU only.

This PR also fixes the implicit conversion loss in math_utils.h (found in src/gpu/intel/include) thanks to check introduced by @rjoursler comments in part 1.

Fixes # (MFDNN-14771, MFDNN-14896)

Checklist

General

  • Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
  • Have you formatted the code using clang-format?

@h-sadia h-sadia requested review from a team as code owners April 9, 2026 19:44
@github-actions github-actions bot added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel component:tests Codeowner: @oneapi-src/onednn-arch labels Apr 9, 2026
@h-sadia
Copy link
Copy Markdown
Contributor Author

h-sadia commented Apr 9, 2026

make test
set test_scope=NIGHTLY
disable test_device_cpu
enable arch_gpu_xe-hpc
enable arch_gpu_xe-hpg-atsm
enable arch_gpu_xe-hpg-dg2
enable arch_gpu_xe-lp
enable arch_gpu_xe-lpg
enable arch_gpu_xe-lpg+
enable arch_gpu_xe2-hpg-bmg
enable arch_gpu_xe2-lpg
enable arch_gpu_xe3-lpg

@h-sadia h-sadia force-pushed the hsadia/offset_widen_pt2 branch from 47bf4aa to be0f323 Compare April 9, 2026 23:56
@h-sadia h-sadia force-pushed the hsadia/offset_widen_pt2 branch from be0f323 to 4127b9e Compare April 10, 2026 00:00
@h-sadia
Copy link
Copy Markdown
Contributor Author

h-sadia commented Apr 10, 2026

make test
set test_scope=NIGHTLY
disable test_device_cpu
enable arch_gpu_xe-hpc
enable arch_gpu_xe-hpg-atsm
enable arch_gpu_xe-hpg-dg2
enable arch_gpu_xe-lp
enable arch_gpu_xe-lpg
enable arch_gpu_xe-lpg+
enable arch_gpu_xe2-hpg-bmg
enable arch_gpu_xe2-lpg
enable arch_gpu_xe3-lpg

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

component:tests Codeowner: @oneapi-src/onednn-arch platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants