Directory | Changes | Lines of Code | Lines per Change |
---|---|---|---|
Totals | 218 (100.0%) | 6306 (100.0%) | 28.9 |
/ | 18 (8.3%) | 1502 (23.8%) | 83.4 |
CCA/Components/Schedulers/ | 49 (22.5%) | 1229 (19.5%) | 25.0 |
Core/Grid/ | 7 (3.2%) | 1125 (17.8%) | 160.7 |
Core/Grid/Variables/ | 46 (21.1%) | 622 (9.9%) | 13.5 |
CCA/Components/Arches/ | 18 (8.3%) | 595 (9.4%) | 33.0 |
CCA/Components/Arches/ChemMix/ | 3 (1.4%) | 518 (8.2%) | 172.6 |
Core/Math/ | 6 (2.8%) | 223 (3.5%) | 37.1 |
CCA/Components/Models/Radiation/RMCRT/ | 14 (6.4%) | 203 (3.2%) | 14.5 |
include/sci_defs/ | 8 (3.7%) | 129 (2.0%) | 16.1 |
CCA/Components/Examples/ | 25 (11.5%) | 74 (1.2%) | 2.9 |
CCA/Components/Arches/TransportEqns/ | 2 (0.9%) | 21 (0.3%) | 10.5 |
Core/Parallel/ | 5 (2.3%) | 18 (0.3%) | 3.6 |
CCA/Components/Arches/Radiation/ | 2 (0.9%) | 14 (0.2%) | 7.0 |
CCA/Components/Arches/PropertyModels/ | 2 (0.9%) | 13 (0.2%) | 6.5 |
Core/Geometry/ | 1 (0.5%) | 9 (0.1%) | 9.0 |
CCA/Components/DataArchiver/ | 1 (0.5%) | 3 (0.0%) | 3.0 |
R_Tester/helpers/ | 1 (0.5%) | 2 (0.0%) | 2.0 |
CCA/Ports/ | 1 (0.5%) | 2 (0.0%) | 2.0 |
StandAlone/ | 1 (0.5%) | 1 (0.0%) | 1.0 |
Core/Util/ | 1 (0.5%) | 1 (0.0%) | 1.0 |
CCA/Components/Models/Radiation/ | 2 (0.9%) | 1 (0.0%) | 0.5 |
CCA/Components/MPM/ | 2 (0.9%) | 1 (0.0%) | 0.5 |
StandAlone/tools/puda/ | 2 (0.9%) | 0 (0.0%) | 0.0 |
CCA/Components/Arches/SourceTerms/ | 1 (0.5%) | 0 (0.0%) | 0.0 |
Make sure to only init and clear GPU-DW if actualy using devices. This code was executing when HAVE_CUDA was defined without SUS explicitly getting the -gpu option.
0 lines of code changed in 2 files:
Cleanup preallocateDeviceMemory() using subtype information from GridVariables. Much cleaner.
Fix incorrect position of mutex unlock in device/host copy methods.
0 lines of code changed in 2 files:
Complete support for GPUStencil7.
A cleaner solution would be quite easy with type information support for GPU Variables. This will be the next step.
Also some code cleanup.
224 lines of code changed in 1 file:
Copy the correct number of bytes back from the device in postD2HCopies
Cleanup some comments in Task.h
4 lines of code changed in 2 files:
Fix typo in puda-varsummary code.
0 lines of code changed in 2 files:
Begin GPU ParticleVariable support.
129 lines of code changed in 6 files:
* Fix GPUReductionVariable support; adhere to default material index being -1 for ReductionVariables in general.
* Some code cleanup in Unified scheduler.
96 lines of code changed in 9 files:
Finish support for GPUReductionVariable.
109 lines of code changed in 12 files:
Cleanup some debug output.
5 lines of code changed in 2 files:
* Working support for GPU ReductionVariables.
* Also made OnDemandDataWarehouse aware of "SingleDevice" DebugStream when creating GPU Datawarehouses.
398 lines of code changed in 8 files:
The beginnings of an inheritance hierarchy for GPU Variables.
123 lines of code changed in 3 files:
Add support for GPU Stencil7.
Add additional cuda_testdefs macro for range checking on the GPU when SCI_CUDA_ASSERTION_LEVEL >= 3
79 lines of code changed in 4 files:
Put GPUVector code within SCIRun namespace and add header guard.
Add using namespace directive accordingly to RayGPU kernel and header.
13 lines of code changed in 3 files:
Some cosmetic and debug output changes.
19 lines of code changed in 4 files:
Add Unified Scheduler support for automatic copying of generic GridVariables to the GPU. This support can be easily extended to Stencil7, ParticleVariable, etc with an expanded inheritance heirarchy of "GPUVariables":
GPUVariable
|------GPUGridVariableBase
| |------GPUGridVariable<T>
|------GPUReductionVariableBase
| |------GPUReductionVariable<T>
|------GPUStencil7Base
| |------GPUStencil7<T>
|------GPUParticleVariableBase
| |------GPUParicleVariable<T>
* Need to implement a cleaner way to "getRegion" for RMCRT. This works for now.
* Also some small changes to the UnifedSchedulerTest example.
* Fix typo in RayGPU.cc
402 lines of code changed in 6 files:
Fix linking order of NVCC_LIBS.
Generate new configure script.
2 lines of code changed in 3 files:
Add support for arbitrary Uintah constVariables to return their data sizes (templated) as well as copy their underlying data to a destination buffer for automatic host<-->device copies.
Also make UnifiedScheduler a friend of OnDemandDataWarehouse.
29 lines of code changed in 7 files:
Add support for single GPU usage on multi-GPU nodes. This should ease the pain of debugging in multi-GPU environments.
8 lines of code changed in 1 file:
Add support for arbitrary Uintah Variables to return their data sizes (templated) as well as copy underlying data to a destination buffer.
This functionality will be essential for the Unified Scheduler to genericaly handle automatic copying of any Uintah Variable (of type<T>). Note: most of this functionality analogous to the preparing MPI buffers for Uintah variables.
144 lines of code changed in 21 files:
Fix typo in previously committed CUDA error handling wrapper.
1 lines of code changed in 1 file:
(41 more)