Updated method names. Should have been in previous commit.
3 lines of code changed in 1 file:
added
existsLevelDB() // checks for variable in levelDB
removeLevelDB() // flags variable for removal
clear()
- added loop over levelDB items and remove them.
89 lines of code changed in 2 files:
Throw an exception if the user tries to use the -gpu option without the Unified scheduler.
7 lines of code changed in 1 file:
postH2DCopies(), preallocateDeviceMemory(), postD2HCopies()
- light formatting,
- cleaned up the gpu_stats output so it's easier to follow.
89 lines of code changed in 1 file:
added print() method, which dumps both varDB and levelDB.
36 lines of code changed in 2 files:
Added level index and __LINE__ to DebugStream output.
12 lines of code changed in 2 files:
Added unifiedMemory class and GPUString class which are used to pass the varLabel struct into
the kernel. The varLabel struct isn't plugged in yet.
133 lines of code changed in 3 files:
DebugStream GPU related names are consistent with what's specified in environmentalFlags.txt
2 lines of code changed in 1 file:
Output details on the GPU being used. On computers with multiple
GPUs this is useful. For example:
GPU Device 0: "GeForce GTX TITAN Black" with compute capability 3.5
8 lines of code changed in 1 file:
Need to initialize parallel infrastructure (even if not running in parallel).
0 lines of code changed in 2 files:
Fixed GPU bulletproofing to allow the 2Level RMCRT algorithm.
This would have been caught with GPU RT coverage.....
17 lines of code changed in 2 files:
Ray.cc
- mimic the debugging output in the GPU kernel code for direct comparison
- simplified the jumping levels conditional statement.
- Don't jump levels at the edge of the computational domain.
RayGPU*
- mimic the debugging output in the CPU code for direct comparison
- simplified the jumping levels conditional statement.
- throw exception if number of levels > d_MAXLEVELS
141 lines of code changed in 4 files:
Cleaned up some commented out trial balloons. Added some logic to keep
fine patches from disappearing, even if they don't have refined particles
on them.
37 lines of code changed in 1 file:
Some minor refactoring and debug stream improvements while hunting down the multi-threaded Hypre bug.
13 lines of code changed in 6 files:
Increased max variableDB and levelDB items. Needed for RMCRT DO 3Levels
2 lines of code changed in 1 file:
Don't include extra cells when looking to see if a cell has a finer cell above
it.
1 lines of code changed in 1 file:
Refactor mispelled method name.
4 lines of code changed in 2 files:
Swith order of execution. Final property models should have occurred
second.
4 lines of code changed in 1 file:
Avoid divide by zero.
5 lines of code changed in 1 file:
Added ability to monitor a vector of dbgCells when debugging.
45 lines of code changed in 4 files:
More modification to the heat flux model. Now integrating total heat
into the surface then renomalizing by area to get a total flux into the
wall.
18 lines of code changed in 1 file:
Update to newest BOOST flags. Add Mustang
20 lines of code changed in 2 files:
Compute the exposed surface area for the heat flux and store it.
32 lines of code changed in 2 files:
added bulletproofing for GPU code:
- throw exception if halo > 0
- throw exception if ROI != patch_based
clean out some sanity checks debugging code.
21 lines of code changed in 3 files:
Remove unused variable.
0 lines of code changed in 1 file:
Abstract out GPUDataOnion functionality into device function (updateSumI_MLDevice<T>).
221 lines of code changed in 1 file:
Report the level that detailed task is running on.
1 lines of code changed in 1 file:
output run time in sec.milliseconds to the stats file
0 lines of code changed in 2 files:
Added initialization model. This model makes radiation comparisons on the first time-step.
16 lines of code changed in 5 files:
added -f option to /bin/csh command
This ignores the user's .cshrc file.
0 lines of code changed in 4 files:
Delay deletion of Ray object.
There seems to be a race condition surrounding the deletion of the SimulationInterface object (UintahParallelComponent) in sus.cc. When this is done, it appears that for the RMCRT_Test destructor may evaluate d_RMCRT as non-null before it is deleted. This allows a second attempt to delete d_RMCRT, causing a double delete.
NOTE: credit goes to John Holmen for finding this one. Thanks John.
4 lines of code changed in 1 file:
Previous logic was broken for multi-patch
19 lines of code changed in 1 file:
Adding a total heat flux variable that is the sum over all components.
Placing this value in the wall cell adjacent to the flow.
25 lines of code changed in 1 file:
Consolidated sci_debug stream code for "put" calls into printDebuggingPutInfo() function.
Added several calls to printDebuggingPutInfo()
This eases the swirling head symptoms.
39 lines of code changed in 2 files:
Stop kernel execution once printError calls are made.
At this point we know it's fatal and the reason has been reported.
6 lines of code changed in 1 file:
Fix bug in GPUDW error reporting. Was reporting mixture of varDB and LevelDB info in levelDB error reporting.
Modify/enhance other GPUDW reporting.
23 lines of code changed in 3 files:
Added new DOSweep example component for testing/development purposes.
334 lines of code changed in 4 files:
* Refactoring on var and level databases.
* Better reporting on capacity, etc.
55 lines of code changed in 2 files:
Eliminate duplicate levelDB items in GPUDW.
8 lines of code changed in 1 file:
printGetError() & printGetLevelError()
output matlIndx, patchID and levelIndex about all available labels
4 lines of code changed in 1 file:
For clarity global variable name changes
levelID -> levelIndx
matlID -> matlIndx
94 lines of code changed in 2 files:
added printGetLevelError() a method for printing out errors related to the levelDB .
27 lines of code changed in 2 files:
* Further GPUDW thread safety.
* Check return values of CUDA calls.
* Change private methods to return bool - status of CUDA calls made within fucntion body.
214 lines of code changed in 2 files:
Get the A in AMR working. Fixed a small bug in the calculation of the particle
size as particles move between levels.
Contains a fair bit of debugging output, but I'm the only one running it, so...
265 lines of code changed in 2 files:
Adding a CC variable to mark cells for refinement.
4 lines of code changed in 2 files:
Added matlIndx to the GPUDW level database. Calling methods need to include a matlIndx.
24 lines of code changed in 4 files:
SCI_DEBUG output: more descriptive and improved formatting.
26 lines of code changed in 2 files:
previous commit had a compile bug. Fixed.
18 lines of code changed in 2 files:
1 patch with a ray in 1 cell now works GPU:DataOnion results == CPU:DataOnion
Mirroring debugging printf statements in GPU & CPU code.
31 lines of code changed in 2 files:
changed DEBUG_MT -> FIXED_RANDOM_NUM to be consistent with GPU RMCRT code.
3 lines of code changed in 1 file:
computeExtents()
- using >getExtraCellLowIndex() & getExtraCellHighIndex() instead of getCellLowIndex & getCellHighIndex()
This fixes the halo[0,0,0] bug.
Cleaned up the debugging output so it's easier to follow. No using d_dbgCell for
a tracking cell instead of hardwired IntVector.
37 lines of code changed in 3 files:
Report when using GPUs
9 lines of code changed in 1 file:
Fix race conditions in the GPU DataWarehouse (varDB and levelDB)
This was unpleasent to track down.
GPU DataOnion now runs fully distributed with arbitrary numbers of patches.
30 lines of code changed in 3 files:
Comment out the Ray Tracing chunk of code for Alan.
Wrapped conditionals with multiple tests in (). The behavior of CUDA differs from C++
Added more debugging output.
77 lines of code changed in 3 files:
Incremental progress. One z slice will run through all
the pieces of code. There are "GPU OUT_OF_BOUND" errors.
66 lines of code changed in 2 files:
Moved the gut of updateSumI_ML into the main kernel. Hack to get it up and running.
updated ToDo list.
It compiles and doesn't crash.
238 lines of code changed in 2 files:
Changed name of global variable levels -> d_levels,
updateSumI_MLDevice()
Now Using d_levels
Added debugging code
There are problems when accessing the grid vars.
85 lines of code changed in 2 files:
Revert hypre configure changes.
18 lines of code changed in 2 files:
More pieces filled in.
getCellPosition() and mapCellToCoarser() are now working.
Moved operators to the top of the .cuh file. Now they can be used downstream in that file.
219 lines of code changed in 3 files:
#if an additional line until updateSumI_MLDevice< T > is working.
#if some debugging code.
8 lines of code changed in 1 file:
Revert "Add a simple memory allocation check for hypre for library determination."
This reverts commit e1bbba9fb08718314244a92876afce8eed923060.
1 lines of code changed in 2 files:
Correct access to fine level patches in GPUDW. Add enhanced debug output.
43 lines of code changed in 1 file:
GPUVector.h
- removed my previous commit. It's compilicates debugging in DDT.
Passing fineLevel_ROI into the kernel
Retreiving fine level data from GPU_dw is not working as it should....
74 lines of code changed in 5 files:
Add a simple memory allocation check for hypre for library determination.
7 lines of code changed in 2 files:
Reverse the order in which the hypre libraries are discovered.
17 lines of code changed in 2 files:
Strip out 2-level hard-wiring, now using levelParams.
Clean up GPUDataWarehouse debugging info.
8 lines of code changed in 4 files:
Copy levelParams array to device constant cache. levelParams for GPU:DataOnion is now working.
-- Todd is even happier.
This commit is a point of reference before yanking the 2-level hard-wiring.
68 lines of code changed in 2 files:
added more pieces to the puzzle.
This compiles.
38 lines of code changed in 2 files:
Added convinence methods for GPUVector. They should be moved upstream.
updateSumI_MLDevice()
- fill in the easy pieces.
minor formatting.
283 lines of code changed in 3 files:
Added comment and convience methods for gpuVector & gpuIntVector
5 lines of code changed in 2 files:
Incremental commit. Removed some of the #if 0s.
Most of the pieces are working. UpdateSumI_MLDevice still needs to be filled in.
22 lines of code changed in 1 file:
Increase the limit on MAX_LVITEM.
1 lines of code changed in 1 file:
Fix issues with GPUDataWarehouse levelDB.
GPU DataOnion coarse levels are now handled correctly.
- Todd is happy
14 lines of code changed in 2 files:
Use levelDB in GPUDW for per level vars. Provide get, put, allocateAndPut interface for this via GPUDW.
Also remove code related to cudaHostRegister and pinned pointers in UnifiedScheduler. This won't be used and ultimately will be replaced with contiguous buffer copies.
174 lines of code changed in 4 files:
Add correct multi-level GPUDW support.
This also fixes faulty level->findCellIndexRange() and subsequent dw->getRegion() calls on coarse levels.
17 lines of code changed in 1 file:
Revert commit r53338
1 lines of code changed in 1 file:
Removed clutter in code by Changing T*T*T*T to pow(T,4). Will break all radiation tests.
2 lines of code changed in 1 file:
added updateSumI_MLDevice() which is just a copy ofupdateSumI_Device() at this point.
Heavy construction zone with little working.
275 lines of code changed in 2 files:
Put GPUGridVariables into GPUDW under the correct levelID.
1 lines of code changed in 1 file:
Fix CQMOM inversion for N-D internal coordinates, update to input files for these cases and decrease total quadrature nodes
3 lines of code changed in 1 file:
Added the utilities
isThread0
printThread()
printBlock()
printGetError()
- added level to the print statement. Only output with thread0, this helps but
there are still issues.
48 lines of code changed in 4 files:
The most shameful commit I've ever made.
- Pass each of the level variables, level by level into
the kernel. I need to move foreward and this grossness works. I spent 10X too much time
trying to figure out how to pass an array of struct into a kernel.
It crashes trying to get abskg from the dw.
264 lines of code changed in 3 files:
Pair the density label in the scheduler.
8 lines of code changed in 3 files:
* Explicitly identify the number of processors
1 lines of code changed in 1 file:
* Cleanup
9 lines of code changed in 2 files:
Added parametric study to quantify divQ error vs fine patch layout
127 lines of code changed in 3 files:
fix bug when gas-particle coupling is turned on. The bug affected the pressure projection and resulted in mass addition to the system. This will change RT answers.
3 lines of code changed in 1 file: