Hi,
I'm a senior GPGPU developer for many years. I've work for many CUDA/OpenCL projects and achieve speedup from 10-43x faster than sequential application. I have many experiences in optimization and multiple GPU programming.
I'd like to work this project for you.
Regards,
Tam Nguyen.