Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Lua and Torch issues with GPu

I am trying to run the Lua based program from the OpenNMT. I have followed the procedure from here : http://forum.opennmt.net/t/text-summarization-on-gigaword-and-rouge-scoring/85

I have used the command:

 th train.lua -data textsum-train.t7 -save_model textsum1 -gpuid 0 1 2 3 4 5 6 7

I am using 8 GPUs but still the process is damn slow as if the process is working on the CPU. kindly, let me know what might be the solution for the optimizing the GPU usage.
Here is the stats of the GP usage:
GPUSTATS

Kindly, let me know how I can make the process run faster using the complete GPUs. I am available with 11GBs, but the process only consumes 2 GB or less. Hence the process is damn slow.

like image 414
Jaffer Wilson Avatar asked Nov 24 '25 11:11

Jaffer Wilson


1 Answers

As per OpenNMT documentation, you need to remove 0 from right after the gpuid option since 0 stands for the CPU, and you are effectively reduce the training speed to that of a CPU-powered one.

To use data parallelism, assign a list of GPU identifiers to the -gpuid option. For example:

th train.lua -data data/demo-train.t7 -save_model demo -gpuid 1 2 4

will use the first, the second and the fourth GPU of the machine as returned by the CUDA API.

like image 193
Wiktor Stribiżew Avatar answered Nov 27 '25 03:11

Wiktor Stribiżew



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!