26 to 50 of 230 Permanent CUDA Jobs

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Idaho, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Hialeah, Florida, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Austin, Texas, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Tallahassee, Florida, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Riverside, California, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Naperville, Illinois, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Lubbock, Texas, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Wichita, Kansas, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Renton, Washington, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Canton, Ohio, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Owensboro, Kentucky, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Bowie, Maryland, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Kokomo, Indiana, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Yuma, Arizona, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Lawton, Oklahoma, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Honolulu, Hawaii, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Minnetonka, Minnesota, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Murfreesboro, Tennessee, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Reno, Nevada, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Springdale, Arkansas, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Puyallup, Washington, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Gary, Indiana, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Plymouth, Minnesota, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Seattle, Washington, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Roanoke, Virginia, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...