Skip to content

Commit 21fdb68

Browse files
committed
Merge branch 'dev' into release
2 parents 2fa163a + 17b1899 commit 21fdb68

File tree

2 files changed

+14
-2
lines changed

2 files changed

+14
-2
lines changed

README.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -94,7 +94,7 @@ the apps in from this collection as well as just on your own, with your own apps
9494
source ./gpu-simulator/setup_environment.sh
9595
make -j -C ./gpu-simulator/
9696
```
97-
This will prodice an executable in:
97+
This will produce an executable in:
9898
```bash
9999
./gpu-simulator/bin/release/accel-sim.out
100100
```
@@ -114,7 +114,8 @@ the apps in from this collection as well as just on your own, with your own apps
114114
115115
To understand what is going on and how to just run the simulator in isolation without the framework, read [this](https://github.com/accel-sim/accel-sim-framework/tree/dev/util/job_launching/README.md).
116116
117-
3. **Accel-Sim Correlator**: A tool that matches, plots and correlates statistics from the performance model with real hardware statistics generated by profiling tools. To use the correlator, you must first generate hardware output and simulation statistics. To generate output from the GPU, use the scripts in ./util/correlation. For example, to generate the profiler numbers for the short-running apps in our running example, do the following:
117+
3. **Accel-Sim Correlator**: A tool that matches, plots and correlates statistics from the performance model with real hardware statistics generated by profiling tools. To use the correlator, you must first generate hardware output and simulation statistics. To generate output from the GPU, use the scripts in [./util/hw_stats](./util/hw_stats).
118+
For example, to generate the profiler numbers for the short-running apps in our running example, do the following:
118119
Note that this step assumes you have already built the apps using the instructions from (1).
119120
```bash
120121
./util/hw_stats/run_hw.py -B rodinia_2.0-ft

gpu-simulator/README.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
# Accel-Sim Trace-Driven Front-end
2+
3+
![Accel-Sim Class Overview](https://accel-sim.github.io/assets/img/accel-sim-class.png)
4+
5+
The Accel-Sim's performance model relies on GPGPU-Sim 4.0 Performance mopdel. We created new classes with "exec_" and "trace_" prefix that are inherited from gpgpu-sim performance model classes, then we moved some function implementations from the performance model to the new inherited classes using virtual functions. These functions are the ones that differ from exec-driven mode versus trace-driven mode. For example, when gpgpu-sim call the function "get_next_inst()", the exec_shader_core_ctx implementation will get the next instruction from the functional model, whereas the trace_shader_core_ctx will get the next inst from the traces.
6+
The blue blocks in the image are maintained in GPGPU-Sim 4.0 repo [here](https://github.com/gpgpu-sim/gpgpu-sim_distribution), whereas the green blocks are maintained by accel-sim in this repo.
7+
8+
For each new GPU generation, we have to crease ISA_def file that specifies the SASS instructions. For now, we have created the ISA_def files for NVIDIA's Kepler, Pascal, Turing and Volta generations. Please see the directory [./trace-driven/ISA_Def](./trace-driven/ISA_Def).
9+
We were able to generate these files using the NVIDIA's CUDA Binary Utilities documentation from [here](https://docs.nvidia.com/cuda/cuda-binary-utilities/index.html#instruction-set-ref).
10+
11+
For more info on the Accel-Sim front-end and how to compile, please see "Accel-Sim SASS Frontend" entry in the main read-me page [here](https://github.com/accel-sim/accel-sim-framework/blob/dev/README.md).

0 commit comments

Comments
 (0)