/trunk/src Developers: qymeng

Login name:
qymeng
Total Commits:
161 (2.3%)
Lines of Code:
1,943 (2.1%)
Most Recent Commit:
2013-10-24 15:31
Tweet this:
Tweet this

Activity by Clock Time

Activity by Hour of Day for qymeng

Activity by Day of Week for qymeng

Activity in Directories

Directory Changes Lines of Code Lines per Change
Totals 161 (100.0%) 1943 (100.0%) 12.0
CCA/Components/Schedulers/ 73 (45.3%) 914 (47.0%) 12.5
Core/Grid/ 10 (6.2%) 232 (11.9%) 23.2
CCA/Components/Examples/ 12 (7.5%) 170 (8.7%) 14.1
Core/Geometry/ 3 (1.9%) 142 (7.3%) 47.3
Core/Grid/Variables/ 5 (3.1%) 139 (7.2%) 27.8
CCA/Components/SimulationController/ 12 (7.5%) 132 (6.8%) 11.0
CCA/Components/Models/Radiation/RMCRT/ 13 (8.1%) 46 (2.4%) 3.5
CCA/Components/DataArchiver/ 6 (3.7%) 37 (1.9%) 6.1
CCA/Ports/ 5 (3.1%) 35 (1.8%) 7.0
include/sci_defs/ 3 (1.9%) 33 (1.7%) 11.0
/ 6 (3.7%) 33 (1.7%) 5.5
Core/Parallel/ 7 (4.3%) 19 (1.0%) 2.7
include/ 2 (1.2%) 11 (0.6%) 5.5
StandAlone/ 2 (1.2%) 0 (0.0%) 0.0
CCA/Components/Parent/ 2 (1.2%) 0 (0.0%) 0.0

Activity of qymeng

Most Recent Commits

qymeng 2013-10-24 15:31 Rev.: 51147

schedule ray CPU task if -gpu argument not present, instead of depends on HAVE_CUDA macro.

17 lines of code changed in 5 files:

  • CCA/Components/Models/Radiation/RMCRT: Ray.cc (+7 -6), Ray.h (-2), RayGPU.cc (+7 -2), sub.mk (new)
qymeng 2013-10-24 13:18 Rev.: 51143

Add support for post GPU kernel processing callback.
A GPU task will be called multipule times and each time an event argument will be passed.
The task function can then use the event to decide what to run when
1) Task::CPU task is scheduled to run on CPU
2) Task::GPU task is scheduled to run on GPU
3) Task::postGPU task is finished execution on GPU, DW has been copied back.

CPU only task interface is not unchanged.

282 lines of code changed in 11 files:

  • CCA/Components/Examples: UnifiedSchedulerTest.cc (+90 -94), UnifiedSchedulerTest.h (+5 -9)
  • CCA/Components/Models/Radiation/RMCRT: Ray.cc (+2 -2), Ray.h (+2 -1), RayGPU.cc (+4 -1)
  • CCA/Components/Schedulers: DetailedTasks.cc (+5 -4), DetailedTasks.h (+1 -1), UnifiedScheduler.cc (+24 -24), UnifiedScheduler.h (+1 -1)
  • Core/Grid: Task.cc (+5 -22), Task.h (+143 -190)
qymeng 2013-10-22 14:11 Rev.: 51130

fix build on blue waters due to new mpi libary changes.

0 lines of code changed in 2 files:

  • StandAlone: sus.cc (changed)
qymeng 2013-10-17 11:20 Rev.: 51110

fix a bug in find_sendset from last commit.

0 lines of code changed in 2 files:

  • CCA/Components/Schedulers: SendState.cc (new)
qymeng 2013-10-15 08:55 Rev.: 51087

fix a particle sending state thread safety issue.

16 lines of code changed in 3 files:

  • CCA/Components/Schedulers: SendState.cc (+15 -2), SendState.h (+1)
qymeng 2013-10-14 10:01 Rev.: 51071

default constructor for GPU vector

8 lines of code changed in 1 file:

  • Core/Geometry: GPUVector.h (+8 -4)
qymeng 2013-10-11 16:11 Rev.: 51069

Use derived class instead of union to support dimensional index.
This has been verfied to produce the same assumbly code as the orignal struct when using dimensional index.

40 lines of code changed in 1 file:

  • Core/Geometry: GPUVector.h (+40 -64)
qymeng 2013-10-04 15:45 Rev.: 51029

add new union type Int3 uInt3 Double3 Float3, so that it can be accessed by dimension index.

95 lines of code changed in 2 files:

  • CCA/Components/Examples: UnifiedSchedulerTestKernel.cu (+1)
  • Core/Geometry: GPUVector.h (+94)
qymeng 2013-10-04 11:11 Rev.: 51023

add const GPU Grid Variable support

70 lines of code changed in 8 files:

  • CCA/Components/Examples: UnifiedSchedulerTestKernel.cu (+2 -2)
  • CCA/Components/Models/Radiation/RMCRT: RayGPU.cuh (+3 -3), RayGPUKernel.cu (+10 -10)
  • CCA/Components/Schedulers: GPUDataWarehouse.cu (+28 -24), GPUDataWarehouse.h (+3 -2), UnifiedScheduler.cc (+10 -3), UnifiedScheduler.h (+1 -1)
  • Core/Grid/Variables: GPUGridVariable.h (+13 -13)
qymeng 2013-09-13 13:49 Rev.: 50937

use d_allmpi to check if running under MPI

0 lines of code changed in 2 files:

  • Core/Parallel: ProcessorGroup.cc (new)
qymeng 2013-09-11 23:25 Rev.: 50931

Do not call MPI_Comm_dup if not running under MPI/threads.

1 lines of code changed in 1 file:

  • Core/Parallel: ProcessorGroup.cc (+1)
qymeng 2013-09-11 19:42 Rev.: 50930

Allocate MPI communicators based on taskgraph.
This fixed a hung when there are large number of reduction tasks in the taskgraph.

18 lines of code changed in 4 files:

  • CCA/Components/Schedulers: TaskGraph.cc (+1 -1)
  • Core/Parallel: ProcessorGroup.cc (+13 -14), ProcessorGroup.h (new)
qymeng 2013-09-05 09:16 Rev.: 50910

add new operator to GPUGridVariable.
GPUGridVaraible can be accessed as var(x,y,z)

28 lines of code changed in 3 files:

  • CCA/Components/Examples: UnifiedSchedulerTestKernel.cu (+7 -7)
  • Core/Grid/Variables: GPUGridVariable.h (+8)
  • include/sci_defs: cuda_testdefs.h.in (+13)
qymeng 2013-09-04 20:34 Rev.: 50909

Added array boundary checking for GPU varaible.
Fixed multi-GPU issue.

82 lines of code changed in 6 files:

  • CCA/Components/Examples: UnifiedSchedulerTestKernel.cu (+3 -12)
  • CCA/Components/Schedulers: GPUDataWarehouse.cu (+41 -15), GPUDataWarehouse.h (+1), UnifiedScheduler.cc (+16 -6)
  • Core/Grid/Variables: GPUGridVariable.h (+6 -4)
  • include/sci_defs: cuda_testdefs.h.in (+15 -1)
qymeng 2013-09-04 16:09 Rev.: 50908

detect and report GPU task kernel error

56 lines of code changed in 5 files:

  • CCA/Components/Schedulers: DetailedTasks.cc (+14 -3), GPUDataWarehouse.cu (+5 -2), OnDemandDataWarehouse.cc (+1 -3), UnifiedScheduler.cc (+36 -16), UnifiedScheduler.h (-2)
qymeng 2013-09-04 11:46 Rev.: 50906

better multi-GPU debug output

19 lines of code changed in 3 files:

  • CCA/Components/Schedulers: GPUDataWarehouse.cu (+16 -10), GPUDataWarehouse.h (+2 -1), OnDemandDataWarehouse.cc (+1 -2)
qymeng 2013-09-03 20:23 Rev.: 50903

Implemented a lightweight version of GPUDataWarehouse, GPUGridVariable and GPUArray3D inside GPU.
This commit included a new set of interfaces for better supporting GPU task.
See example in Examples/UnifiedSchedulerTest

691 lines of code changed in 27 files:

  • CCA/Components/Examples: RMCRT_Test.cc (+1 -1), UnifiedSchedulerTest.cc (+8 -24), UnifiedSchedulerTest.h (+13 -17), UnifiedSchedulerTestKernel.cu (+38 -32), sub.mk (+2 -2)
  • CCA/Components/Models/Radiation/RMCRT: Ray.cc (+2 -2), Ray.h (+3 -3), sub.mk (+3 -3)
  • CCA/Components/Parent: ComponentFactory.cc (new)
  • CCA/Components/Schedulers: DetailedTasks.cc (+8 -85), DetailedTasks.h (+5 -23), GPUDataWarehouse.cu (+204), GPUDataWarehouse.h (+71), OnDemandDataWarehouse.cc (+22), SchedulerFactory.cc (+1 -16), UnifiedScheduler.cc (+148 -413), UnifiedScheduler.h (+7 -46), sub.mk (new)
  • CCA/Ports: DataWarehouse.h (+17 -1)
  • Core/Grid: Task.cc (+1 -1), Task.h (+27 -27)
  • Core/Grid/Variables: GPUGridVariable.h (+104)
  • include/sci_defs: cuda_testdefs.h.in (+5)
qymeng 2013-08-25 21:31 Rev.: 50861

support C++11 style hash map

42 lines of code changed in 5 files:

  • configure (+19), configure.ac (+6)
  • CCA/Components/Schedulers: DWDatabase.h (+6 -2)
  • include: sci_hash_map.h (new)
qymeng 2013-04-16 09:38 Rev.: 50130

skip first timestep for checking output/checkpoint interval changes.

4 lines of code changed in 1 file:

  • CCA/Components/SimulationController: AMRSimulationController.cc (+4 -3)
qymeng 2013-04-15 13:05 Rev.: 50127

For debug stream sendtiming recvtiming taskorder, only output on where rank id is half of the world size.

47 lines of code changed in 9 files:

  • CCA/Components/Schedulers: CommRecMPI.cc (new), DynamicMPIScheduler.cc (new), MPIScheduler.cc (+1), ThreadedMPIScheduler2.cc (new), UnifiedScheduler.cc (+20 -4)

(25 more)

Generated by StatSVN 0.7.0