help > RE: HPC - SLURM - Parralellisation with conn
Jan 12, 2021  10:01 AM | sophieb
RE: HPC - SLURM - Parralellisation with conn
Dear Alfonso,
I did the two things you suggested and tested several scenarios, please see below.
Could you let me know what I should do next to get where the bug is coming from?
Thanks a lot,
Sophie

-------------------------

Everything was run with config -t 10:00:00 -m 8Gb


####I changed the preferences as suggested on the png#####
1) Running my full .m supposed to create the .mat file and run it in conn, with nsubjects=1015/ncore=100;

matlab -nodesktop -nojvm -batch 'run LesionNetworkMappingSoso_SCITAS_17122020v5.m'
after waiting 40min, nothing was happening, no jobs were submitted. (I imagine that if I had waited more, I would have had the bus error below)


2) Creating the .mat manually while debugging and then, run conn_batch(BATCH), with nsubjects=1015/ncore=100;

matlab -nodesktop -nojvm -batch 'run LesionNetworkMappingSoso_SCITAS_17122020v6.m'
parallelization settings loaded from /home/betka/Script_SCITAS/conn/conn_jobmanager.mat
and, after 40min, nothing is submitted. (I imagine that if I had waited more, I would have had the bus error below)





####Then I changed the conn.m (while keeping the preferences as on the png)####
1) Running my full .m supposed to create the .mat file and run it in conn, with nsubjects=1015/ncore=100;

matlab -nodesktop -nojvm -batch 'run LesionNetworkMappingSoso_SCITAS_17122020v5.m'
Warning: Directory already exists.
> In LesionNetworkMappingSoso_SCITAS_17122020v5 (line 11)
In run (line 91)

and after sometimes:
Bus error (core dumped)


2) Creating the .mat manually while debugging and then, run conn_batch(BATCH), with nsubjects=1015/ncore=100;

matlab -nodesktop -nojvm -batch 'run LesionNetworkMappingSoso_SCITAS_17122020v6.m'
parallelization settings loaded from /home/betka/Script_SCITAS/conn/conn_jobmanager.mat

then after a long time:
Bus error (core dumped)


3) As a sanity check I tried to use the same .m as 1) but with nsubjects=10/ncore=100; this works perfectly.

Threaded View

TitleAuthorDate
sophieb Dec 18, 2020
sophieb Jan 15, 2021
sophieb Jan 14, 2021
sophieb Jan 13, 2021
sophieb Jan 11, 2021
Alfonso Nieto-Castanon Jan 11, 2021
Alfonso Nieto-Castanon Jan 11, 2021
RE: HPC - SLURM - Parralellisation with conn
sophieb Jan 12, 2021
Alfonso Nieto-Castanon Jan 12, 2021
sophieb Jan 12, 2021
Alfonso Nieto-Castanon Jan 26, 2021
sophieb Jan 12, 2021
sophieb Jan 11, 2021
sat2020 Dec 18, 2020
Alfonso Nieto-Castanon Dec 18, 2020
sophieb Dec 18, 2020
Alfonso Nieto-Castanon Dec 18, 2020
sophieb Dec 18, 2020
Alfonso Nieto-Castanon Dec 18, 2020
sophieb Dec 19, 2020
Alfonso Nieto-Castanon Dec 19, 2020
sophieb Dec 21, 2020
Alfonso Nieto-Castanon Dec 21, 2020
sophieb Jan 2, 2021
sophieb Jan 8, 2021
sophieb Dec 22, 2020