BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Events - ECPv6.15.20//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-ORIGINAL-URL:https://events.ucsc.edu
X-WR-CALDESC:Events for Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/Los_Angeles
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20250309T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20251102T090000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20260308T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20261101T090000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20270314T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20271107T090000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20260605T080000
DTEND;TZID=America/Los_Angeles:20260605T100000
DTSTAMP:20260601T113043
CREATED:20260527T160819Z
LAST-MODIFIED:20260527T160819Z
UID:10014878-1780646400-1780653600@events.ucsc.edu
SUMMARY:Chen\, Z. (CSE) - GPU Subgroup Semantics for Portable High-Performance Kernels
DESCRIPTION:Modern high-performance GPU kernels increasingly rely on subgroup-level execution\, including subgroup-level communication\, subgroup operations\, and matrix operations. These features are essential for workloads such as matrix multiplication and FlashAttention\, but their language-level guarantees remain difficult to reason about. Existing programming models often leave unclear which threads participate in subgroup operations\, when subgroup threads are required to execute together\, and what synchronization is implied by subgroup-level operations. This ambiguity becomes especially important in portable GPU programming\, where the same kernel may run across devices with different subgroup sizes\, compiler stacks\, browser backends\, and hardware execution behavior. \nMy research studies how precise subgroup semantics can support portable and correct high-performance GPU kernels. SIMT-Step\, my main completed work\, develops a formal and flexible operational semantics for GPU subgroup execution. It introduces dynamic blocks to specify converged subgroup execution and subgroup-operation participation\, classifies instructions as independent\, synchronous\, or collective to express a spectrum of candidate subgroup semantics\, and validates these models through a TLA+ implementation and an empirical fuzzing study across real GPUs. My systems work studies how subgroup-dependent kernels behave in practice\, including WebGPU FlashAttention kernels for LLM inference\, tunable WebGPU kernels for performance portability\, and Vulkan-based execution for heterogeneous SoCs. Building on these foundations\, my proposed verification work develops data-race-free checking techniques for ML kernels that rely on subgroup operations and matrix operations. Together\, these projects aim to clarify the execution guarantees that optimized GPU kernels can rely on and to support portable GPU programming systems whose performance and correctness can be reasoned about across diverse hardware. \nEvent Host: Zheyuan Chen\, Ph.D. Student\, Computer Science & Engineering \nAdvisor: Tyler Sorensen \nZoom: https://ucsc.zoom.us/j/92175288480?pwd=jGajtqerVbKuW1FPNr3awqOYoxATsp.1&jst=3 \nPasscode: 693354
URL:https://events.ucsc.edu/event/chen-z-cse-gpu-subgroup-semantics-for-portable-high-performance-kernels/
LOCATION:Engineering 2\, Engineering 2 1156 High Street\, Santa Cruz\, CA\, 95064
CATEGORIES:Ph.D. Presentations
ATTACH;FMTTYPE=image/jpeg:https://events.ucsc.edu/wp-content/uploads/2026/04/ph.d.-presentation-graphic-option-1.jpg
GEO:37.0009723;-122.0632371
X-APPLE-STRUCTURED-LOCATION;VALUE=URI;X-ADDRESS=Engineering 2 Engineering 2 1156 High Street Santa Cruz CA 95064;X-APPLE-RADIUS=500;X-TITLE=Engineering 2 1156 High Street:geo:-122.0632371,37.0009723
END:VEVENT
END:VCALENDAR