Update README.md
Browse files
README.md
CHANGED
|
@@ -169,6 +169,8 @@ Benchmarking is one of the most important procedures during model acceleration.
|
|
| 169 |
|
| 170 |
The tables below show performance (tokens per second) for different input context sizes across different GPU models and batch sizes:
|
| 171 |
|
|
|
|
|
|
|
| 172 |
**RTX 4090:**
|
| 173 |
|
| 174 |
*Batch Size 1:*
|
|
|
|
| 169 |
|
| 170 |
The tables below show performance (tokens per second) for different input context sizes across different GPU models and batch sizes:
|
| 171 |
|
| 172 |
+
> **Note:** Dash marks (`-`) in the table indicate that the data did not fit on the device.
|
| 173 |
+
|
| 174 |
**RTX 4090:**
|
| 175 |
|
| 176 |
*Batch Size 1:*
|