Here is what Cadence had to say:
Spectre already has an option called "APS" which gives access to enhanced performance without sacrificing accuracy, as well as being able to take advantage of multi-core machines which can give major performance increases overall. It also has capability (early access in MMSIM10.1 but enhanced in MMSIM11.1) which allow you to distribute parallel performance using multiple separate machines which can help in certain situations (contact Cadence to find out more about that).
I doubt going on the course you describe would help, because it will be focused on how you write programs to take advantage of multi-core architectures. Not much you can do if you have existing executables - you'd need the source code and also understanding of how to make the algorithms parallelizable (which is by far the hardest bit compared with the mechanics of using any API) - so clearly that won't help. That's the part that Cadence has done for you.
Anybody got anything to contribute?