Mpi openmp cuda jobs

Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    2,043 mpi openmp cuda jobs found, pricing in USD
    Experienced C Programmer 6 days left
    VERIFIED

    I want to ask you if you can help me with this? for the first task(part), I have all the code, i am just not knowing how to execute the code for part b and part d for different number of workers i sent you code for part(a), it includes the sequential version. I have atteched you task(part) 2 also. they are for same I am using clion on MAC OS and runnin GCC/G++ lang and i am running the mpi on t...

    $10 - $30
    $10 - $30
    0 bids

    We run a containerized implementation of our solution. Our servers are Ubuntu 18/cli, frontend is vuejs with typescript and vuetify and our backend is built on python with fastapi. You will be working with the backend for this project. Our application is using advanced market data for digital assets/finance pulled from our own API and stored in local Redis. We run a series of user specified range...

    $200 - $300
    $200 - $300
    0 bids

    C++ developer require with experience in openmp, Mpi and cuda. More detail on chat.

    $17 (Avg Bid)
    $17 Avg Bid
    2 bids

    The [login to view URL] uses g++ compiler (from mingw-w64) and runs 38+ Million iterations per second on my CPU. The [login to view URL] code in [login to view URL], compiled with Visual Studio 2019 and CUDA Toolkit 11.1 returns only about 1600 iterations per second on the GPU. Can someone help me update so I can repeat the string as fast as possible on the GPU using the CUDA Toolkit 11.1? I wa...

    $205 (Avg Bid)
    $205 Avg Bid
    6 bids

    at least 2- 3 months experience. CUDA as architecture requires C, C++, so we are talking about C, C++ development in general. Basically this is request for Junior type of developers. Trial cooperation period: 3 months. If everything goes well, it will be long-term cooperation, not only in this techological area. So, 3 months, are trial for our companies. CUDA (Compute Unified Device Architec...

    $8 / hr (Avg Bid)
    $8 / hr Avg Bid
    2 bids

    Use OpenMP to calculate the area of Mandelbrot set in C

    $81 (Avg Bid)
    $81 Avg Bid
    4 bids

    Hello, I have a program which creates hashes. And from this hashes I need to create secp256k1 publickeys. The publickeys needs to be created on the cuda device, from the created hashes. I found a repo for cuda-ECC operations, it`s in the ECC folder , with an example how to use it, but due to lack of my coding knowledge I can`t do it by [login to view URL] now it prints the hashes, but I need it to...

    $30 (Avg Bid)
    $30 Avg Bid
    1 bids

    Hi, I am new to Cloud GPU infrastructures like Google Cloud or Amazon AWS. I am deploying my custom deep learning model on my image dataset for my project. But, I am facing problems in how to save the data and train my model and dataset in Google Cloud platform, because I am very new to Google Cloud platform. I am looking for someone who is proficient in training Neural Networks using Pytorch and ...

    $199 (Avg Bid)
    $199 Avg Bid
    12 bids

    Design and implementation of improved parallel lexical analyzer phase of computer design using OpenMP on Multicore Machines Tools to be used for implementation 1. OpenMP tool for Parallelizing programs 2. Multi2sim for simulating multicore architecture 3. Intel Vtune Amplifier for analyzing the performance of the software in 32 and 64-bit machines.

    $175 (Avg Bid)
    $175 Avg Bid
    2 bids

    Add 2 matrices together using OpenMP

    $43 (Avg Bid)
    $43 Avg Bid
    6 bids

    help me build a cuda kernel to find local minima of an array

    $99 (Avg Bid)
    $99 Avg Bid
    9 bids

    I have a project written by Java. This project is very simple. Hashing with Sha256, and curve25519. ex) password = name123 name123 --->sha256--->curve25519--->hash I need more than million calculating power. I don't care about language if program works. ( python, c++, c, java etc) Maybe openCL or cuda is required ( in my opinion )

    $240 (Avg Bid)
    $240 Avg Bid
    8 bids

    I need a threshold segmentation based on local minima in a histogram implemented in CUDA The CUDA kernel will take the histogram (65536) as an argument (and other necessary args) and will return local minima in the histogram. It will need to follow parallel scanning techniques (aka array reduction

    $249 (Avg Bid)
    $249 Avg Bid
    4 bids

    I need a threshold segmentation based on local minima in a histogram implemented in CUDA The CUDA kernel will take the histogram (65536) as an argument (and other necessary args) and will return local minima in the histogram. It will need to follow parallel scanning techniques (aka array reduction

    $150 (Avg Bid)
    $150 Avg Bid
    1 bids

    I need ‘bucket sort’ algorithm’s serial C language code to be implemented as parallelized in both OpenMP, MPI and Cuda. Implementations are needed in 3 separate versions needed. The implementations are should be in C language.

    $10 (Avg Bid)
    $10 Avg Bid
    1 bids

    I need ‘bucket sort’ algorithm’s serial C language code to be implemented as parallelized in both OpenMP, MPI and Cuda. Implementations are needed in 3 separate versions needed. The implementations are should be in C language.

    $10 (Avg Bid)
    $10 Avg Bid
    1 bids

    I need ‘bucket sort’ algorithm’s serial C language code to be implemented as parallelized in both OpenMP, MPI and Cuda. Implementations are needed in 3 separate versions needed. The implementations are should be in C language.

    $45 (Avg Bid)
    $45 Avg Bid
    2 bids

    Hello , I have a project written and running under C++ , and need to convert some function to Cuda.

    $20 (Avg Bid)
    $20 Avg Bid
    2 bids
    $30 Avg Bid
    7 bids
    $30 Avg Bid
    3 bids

    I need sysadmin setup help to configure a machine learning project. The setup uses Ubuntu, NVIDIA CUDA, Anaconda (Python 3), TensorFlow-GPU, Jupyter Notebook. The project is published and running on Google Colab, however I need to run it locally on a dedicated machine. I've tried to install and configure it, however there must be some step I missed or some versioning issue. I need external he...

    $159 (Avg Bid)
    $159 Avg Bid
    16 bids

    We have a C++ application that does data acquisition from an array of electrodes, digitizes the signals, processes them and displays the results to the user. Part of the processing is a signal processing to filter the signals and do spikes detection. This application is running on a desktop under Windows, using the CPU processing power. In order to improve the performance we want to move the signa...

    $2162 (Avg Bid)
    $2162 Avg Bid
    12 bids

    Opencv has a library to do exposure compensation, I need someone that can rewrite it on cuda to make it faster on a Jetson Nano.

    $205 (Avg Bid)
    $205 Avg Bid
    4 bids

    Calculation of communication cost of parallel algorithm of prefix sum on a d-dimensional hypercube and parallel program for it and parllel program's to calculate prefix sum

    $23 (Avg Bid)
    $23 Avg Bid
    3 bids

    Developer preferably should speak Russian/Ukrainian but I am open to candidates who can speak English too. --- ABOUT THE PROJECT: It's a simulation application with support of VR and Desktop PCs focused on rendering, physics and interactions with realistic game characters. Project has been in active development for more than a year. This is a startup. YOUR ROLE: Work directly with a Lead De...

    $21 / hr (Avg Bid)
    $21 / hr Avg Bid
    3 bids

    GPU programming with CUDA & implementation of adaptive filtering algorithm with GPUs

    $432 (Avg Bid)
    $432 Avg Bid
    7 bids

    Looking for someone who has good knowledge in deep learning. Must have experience in Tensor Flow . Knowing google colab or cuda would be helpful since the model takes a long time to run.

    $23 (Avg Bid)
    $23 Avg Bid
    8 bids

    Aiman project Introduction: The current project is to with Neural Machine Translation (NMT). The proposed model is based on the basic transformer architecture. The research direction inspired by previous work in NMT capsule networks which is to apply a dynamic aggregation layer. The other research direction is to apply joint training strategy with bidirectional (RTL and LTR), which is better to ca...

    $173 (Avg Bid)
    $173 Avg Bid
    6 bids

    In this project, you'll enhance two sequential programs, pi_sequential.c and dftw_sequential.c with OpenMP directives. The resulting source codes are to be named pi_openmp.c and dftw_openmp.c accordingly. Each should compile successfully when compiled with -Wall and -Werror flags. Task 1: pi_openmp.c: 5 points Using only the #pragma parallel directive to create a OpenMP implementation of pi...

    $22 (Avg Bid)
    $22 Avg Bid
    5 bids

    I want to convolute/filter a 2D-matrix. A linear convolution kernel in Nvidia CUDA is needed. It has to be optimized for a row-convolution with a 1D-filter (length 11 float elements). The input and output matrix consists of float numbers. The outer padding will be just zeros. The 1D-filter is provided as a __constant__ float*. Optimization should be done by preloading the tile data to the shared ...

    $96 (Avg Bid)
    $96 Avg Bid
    3 bids

    You must be able to install all these packages successfully : librosa==0.7.0 numpy==1.17.1 opencv-contrib-python==4.2.0.34 opencv-python==4.1.0.25 tensorflow==1.12.0 torch==1.1.0 torchvision==0.3.0 tqdm==4.45.0 numba==0.48 to be able to run the script I am after, I am currently running python 3.6. You will need to check wether cuda tools and cuda is installed

    $100 (Avg Bid)
    $100 Avg Bid
    24 bids

    Hi, I have sequential code in the C language, I need you to turn that same code in parallel using openMP, passing the amount of threads as input by the command line and if this amount of threads is different from the specified, return an error message and the program terminates execution. Dou you have interest? Reply to message and we can combine a price after you analyze the code.

    $10 (Avg Bid)
    $10 Avg Bid
    1 bids

    Hi, I have sequential code in the C language, I need you to turn that same code in parallel using openMP, passing the amount of threads as input by the command line and if this amount of threads is different from the specified, return an error message and the program terminates execution. Dou you have interest? Reply to message and we can combine a price after you analyze the code.

    $10 (Avg Bid)
    $10 Avg Bid
    1 bids

    Hi, I have sequential code in the C language, I need you to turn that same code in parallel using openMP, passing the amount of threads as input by the command line and if this amount of threads is different from the specified, return an error message and the program terminates execution. Dou you have interest? Reply to message and we can combine a price after you analyze the code.

    $20 (Avg Bid)
    $20 Avg Bid
    1 bids

    Project details in message... contact

    $25 (Avg Bid)
    $25 Avg Bid
    5 bids

    I am building an application which needs to capture an image from 4 different cameras, process image using OpenCV build with OpenCL and stitches 4 Images at the end. I am new to this stream, I was able to build an application using v4l2 to capture Images. I am creating cv::UMat using OpenCL Buffer Created with clCreateBuffer. But transfer captured frame form v4l2 Buffer to OpenCL Buffer, I am us...

    $8 - $20
    $8 - $20
    0 bids

    Using the project which I will provide (K-Means clusterization), written in Visual C++, you should modify it using OpenMP and CUDA, so that, the clusterization will run faster when executed by CPU and GPU together. The objectives are: - split the task in an array of subtasks - distribute each subtask to CPU or GPU following some criterias which makes the distribution be the optimal one - posibil...

    $4 / hr (Avg Bid)
    $4 / hr Avg Bid
    4 bids

    Parallel solution of a work (I have done the sequential version) using C++ and OpenMP or MPI. Using my sequential solution in [login to view URL] to parallelize it. And screenshots of running program and trials of it.

    $23 (Avg Bid)
    $23 Avg Bid
    5 bids

    Parallel solution of a work (I have done the sequential version) using C++ and OpenMP or MPI. Using my sequential solution in [login to view URL] to parallelize it. And screenshots of running program and trials of it.

    $4 / hr (Avg Bid)
    $4 / hr Avg Bid
    3 bids

    I have a computer running Ubuntu 18.04 with 2 RTX 2080 Ti graphic cards. I went through hell getting the nvidia drivers installed but I got it running Nvidia driver 450.66 and when I run Nvidia-SMI in terminal it says its running cuda version 11.0 But after setting up a anaconda enviorment and source activate RCNN into the enviroment and pip install tensorflow and tensorflow-gpu (It installed te...

    $67 (Avg Bid)
    $67 Avg Bid
    5 bids

    Hello, We are working on G4 instances which support and GPU However while opening the jupyter notebook by connecting through SSH, we are unable to enable CUDA/CuDNN Drivers in the notebook to work on tensorflow

    $14 (Avg Bid)
    $14 Avg Bid
    1 bids

    Deep learning CNN optimization using CUDA

    $3 / hr (Avg Bid)
    $3 / hr Avg Bid
    4 bids

    Hello, hope you are doing well I have an urgent work to be done. Maybe you can help? Deadline is 48h. I have problem with Colab Notebook. To be more precise all works good till Automatic Post Processing section. In this section I must process [login to view URL] file but it throws errors /pytorch/aten/src/THC/[login to view URL]: indexSelectLargeIndex: block: [201,0,0], thread: [12,0,0] Assertio...

    $187 (Avg Bid)
    $187 Avg Bid
    7 bids
    $34 / hr Avg Bid
    10 bids

    I would like to translate a project i have t

    $19 (Avg Bid)
    $19 Avg Bid
    6 bids

    Cuda to read multiple csv and perform sql select functionalities using lag and lead and aggregation functions and basic algebraic functions

    $357 (Avg Bid)
    $357 Avg Bid
    4 bids

    Requirement 1: Moving Object detection and Tracking using OpenCV and Image processing methods Programming: C/C++/Python Libraries : OpenCV , All Image processing libraries Outcome : Application is able to detect moving objects in moving camera platform and its tracking path is also to be displayed . Requirement 2: Object Detection and classification Programming : Python/ CUDA /C/C++ Librarie...

    $244 (Avg Bid)
    $244 Avg Bid
    12 bids

    (Please contact no experience). We have developed AI algorithm on x86. We are thinking of making it work with Jetson Xavier. The basic application is using Intel MKL. I have a file called RoiDFT.h/[login to view URL] in it, and I use MKL in the part of the application. MKL is used to calculate IDFT. In RoiDFT, IDFT calculation is executed by multiple threads, and OpenMP performs parallel pro...

    $483 (Avg Bid)
    $483 Avg Bid
    3 bids

    I am interested to develop some examples/assignment using cuDNN with the help of CUDA and tensorflow. i need end to end explanation from installation to execution . Tasks are like: Face detection, Image classification, object detection etc using cuDNN library along with CUDA. So I am looking for expert in CUDA with good understanding of tensorflow and depp learning.

    $160 (Avg Bid)
    $160 Avg Bid
    2 bids

    Looking for someone to help me complete a programming side project using dynamic and static threads to implement a Prewitt filter edge detection algorithm. I've already written a fair majority of the code and am able to provide it upon job acceptance. Will also provide clear requirements. This should be a fairly straight forward task for anybody proficient in C++ and OpenMP. Please don�...

    $132 (Avg Bid)
    $132 Avg Bid
    4 bids