Apologies for my ignorance, but I’d like to run the couette flow example you have provided on multiple processors but I’m not sure how to go about running the code in parallel rather than in serial. Could you please give me some advice on how to go about doing this? I’ve tried looking through the documentation and forum but I wasn’t able to find anything that I could understand. I doubt it’s as simple as compiling the code and then saying mpirun -np 2 ./couette rather than ./couette. Thanks in advance for your help and patience.