26 to 50 of 230 Permanent CUDA Jobs

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Idaho, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Hialeah, Florida, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Austin, Texas, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Tallahassee, Florida, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Riverside, California, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Naperville, Illinois, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Lubbock, Texas, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Wichita, Kansas, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Renton, Washington, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Canton, Ohio, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Owensboro, Kentucky, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Bowie, Maryland, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Kokomo, Indiana, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Yuma, Arizona, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Lawton, Oklahoma, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Honolulu, Hawaii, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Minnetonka, Minnesota, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Murfreesboro, Tennessee, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Reno, Nevada, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Springdale, Arkansas, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Puyallup, Washington, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Gary, Indiana, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Plymouth, Minnesota, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Seattle, Washington, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation
Mercor
Location
Roanoke, Virginia, United States
Employment Type
Permanent
Salary
USD 250 Hourly
Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...