pub struct TrackedCudaSlice<T: DeviceRepr> { /* private fields */ }Expand description
A CudaSlice that automatically updates GpuMemoryManager
allocation tracking on drop. Inner slice is wrapped in
ManuallyDrop so the [Backing] enum can choose between
cudarc-side free (legacy) and runtime-side deallocate (migrated)
without producing a double-free.
Implementations§
Source§impl<T: DeviceRepr> TrackedCudaSlice<T>
impl<T: DeviceRepr> TrackedCudaSlice<T>
pub fn device_ptr(&self) -> &CUdeviceptr
pub fn device_ptr_value(&self) -> CUdeviceptr
Sourcepub fn memory_manager_ptr_value(&self) -> usize
pub fn memory_manager_ptr_value(&self) -> usize
Stable address of the memory manager that owns this allocation.
Sourcepub fn runtime_block(&self) -> Option<&DeviceBlock>
pub fn runtime_block(&self) -> Option<&DeviceBlock>
Borrow the underlying DeviceBlock for runtime-backed
allocations. Returns None for legacy cudarc-backed
slices ([Backing::Cudarc]) — those are not tracked by
the v0.6 device runtime and therefore have no
runtime-side block to record uses against.
Callers (notably crate::launch::LaunchRecorder) use
this to attach cross-stream uses via
crate::device_runtime::XlogDeviceRuntime::record_block_use.
A None return signals that the slice is on the legacy
path and the recorder cannot track its lifetime — callers
must either route the allocation through
GpuMemoryManager::with_runtime or accept that no
cross-stream safety applies to this buffer.
Sourcepub fn into_bytes(self) -> TrackedCudaSlice<u8>
pub fn into_bytes(self) -> TrackedCudaSlice<u8>
Reinterpret this typed allocation as a raw byte allocation.
This is a zero-copy conversion used by XLOG’s columnar
CudaBuffer representation, which stores device memory as
untyped bytes + a schema. The conversion preserves the
underlying [Backing] — runtime-routed slices remain
runtime-routed, legacy cudarc slices remain cudarc-routed —
so deallocation continues to match the original allocator.
Methods from Deref<Target = CudaSlice<T>>§
pub fn stream(&self) -> &Arc<CudaStream>
pub fn stream(&self) -> &Arc<CudaStream>
The stream this object was allocated on and later will be dropped on.
pub fn try_clone(&self) -> Result<CudaSlice<T>, DriverError>
pub fn try_clone(&self) -> Result<CudaSlice<T>, DriverError>
Allocates copy of self and schedules a device to device copy of memory.
pub fn as_view(&self) -> CudaView<'_, T>
pub fn as_view_mut(&mut self) -> CudaViewMut<'_, T>
pub fn slice(&self, bounds: impl RangeBounds<usize>) -> CudaView<'_, T>
pub fn slice(&self, bounds: impl RangeBounds<usize>) -> CudaView<'_, T>
Creates a CudaView at the specified offset from the start of self.
Panics if range.start >= self.len.
§Example
let mut slice = stream.alloc_zeros::<u8>(100).unwrap();
let mut view = slice.slice(0..50);
do_something(&view);Like a normal slice, borrow checking prevents the underlying CudaSlice from being dropped.
let view = {
let mut slice = stream.alloc_zeros::<u8>(100).unwrap();
// cannot return view, since it borrows from slice
slice.slice(0..50)
};
do_something(&view);pub fn try_slice(
&self,
bounds: impl RangeBounds<usize>,
) -> Option<CudaView<'_, T>>
pub fn try_slice( &self, bounds: impl RangeBounds<usize>, ) -> Option<CudaView<'_, T>>
Fallible version of CudaSlice::slice().
pub fn slice_mut(
&mut self,
bounds: impl RangeBounds<usize>,
) -> CudaViewMut<'_, T>
pub fn slice_mut( &mut self, bounds: impl RangeBounds<usize>, ) -> CudaViewMut<'_, T>
Creates a CudaViewMut at the specified offset from the start of self.
Panics if range and 0...self.len() are not overlapping.
§Example
let mut slice = stream.alloc_zeros::<u8>(100).unwrap();
let mut view = slice.slice_mut(0..50);
do_something(&mut view);Like a normal mutable slice, borrow checking prevents the underlying CudaSlice from being dropped.
let mut view = {
let mut slice = stream.alloc_zeros::<u8>(100).unwrap();
// cannot return view, since it borrows from slice
slice.slice_mut(0..50)
};
do_something(&mut view);Like with normal mutable slices, one cannot mutably slice twice into the same CudaSlice:
let mut slice = stream.alloc_zeros::<u8>(100).unwrap();
let mut view1 = slice.slice_mut(0..50);
// cannot borrow twice from slice
let mut view2 = slice.slice_mut(50..100);
do_something(view1, view2);If you need non-overlapping mutable views into a CudaSlice, you can use CudaSlice::split_at_mut().
pub fn try_slice_mut(
&mut self,
bounds: impl RangeBounds<usize>,
) -> Option<CudaViewMut<'_, T>>
pub fn try_slice_mut( &mut self, bounds: impl RangeBounds<usize>, ) -> Option<CudaViewMut<'_, T>>
Fallible version of CudaSlice::slice_mut
pub unsafe fn transmute<S>(&self, len: usize) -> Option<CudaView<'_, S>>
pub unsafe fn transmute<S>(&self, len: usize) -> Option<CudaView<'_, S>>
Reinterprets the slice of memory into a different type. len is the number
of elements of the new type S that are expected. If not enough bytes
are allocated in self for the view, then this returns None.
§Safety
This is unsafe because not the memory for the view may not be a valid interpretation
for the type S.
pub unsafe fn transmute_mut<S>(
&mut self,
len: usize,
) -> Option<CudaViewMut<'_, S>>
pub unsafe fn transmute_mut<S>( &mut self, len: usize, ) -> Option<CudaViewMut<'_, S>>
Reinterprets the slice of memory into a different type. len is the number
of elements of the new type S that are expected. If not enough bytes
are allocated in self for the view, then this returns None.
§Safety
This is unsafe because not the memory for the view may not be a valid interpretation
for the type S.
pub fn split_at(&self, mid: usize) -> (CudaView<'_, T>, CudaView<'_, T>)
pub fn try_split_at(
&self,
mid: usize,
) -> Option<(CudaView<'_, T>, CudaView<'_, T>)>
pub fn try_split_at( &self, mid: usize, ) -> Option<(CudaView<'_, T>, CudaView<'_, T>)>
Fallible version of CudaSlice::split_at. Returns None if mid > self.len.
pub fn split_at_mut(
&mut self,
mid: usize,
) -> (CudaViewMut<'_, T>, CudaViewMut<'_, T>)
pub fn split_at_mut( &mut self, mid: usize, ) -> (CudaViewMut<'_, T>, CudaViewMut<'_, T>)
Splits the CudaSlice into two at the given index, returning two CudaViewMut for the two halves.
Panics if mid > self.len.
This method can be used to create non-overlapping mutable views into a CudaSlice.
let mut slice = stream.alloc_zeros::<u8>(100).unwrap();
// split the slice into two non-overlapping, mutable views
let (mut view1, mut view2) = slice.split_at_mut(50);
do_something(view1, view2);pub fn try_split_at_mut(
&mut self,
mid: usize,
) -> Option<(CudaViewMut<'_, T>, CudaViewMut<'_, T>)>
pub fn try_split_at_mut( &mut self, mid: usize, ) -> Option<(CudaViewMut<'_, T>, CudaViewMut<'_, T>)>
Fallible version of CudaSlice::split_at_mut.
Returns None if mid > self.len.