101 to 125 of 230 Permanent CUDA Jobs

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Jersey City, New Jersey, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Johnson City, Tennessee, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Paterson, New Jersey, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Jefferson City, Missouri, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Newark, New Jersey, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Hempstead, New York, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Farmington Hills, Michigan, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Bowling Green, Kentucky, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Apache Junction, Arizona, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Huntington, West Virginia, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Saint Paul, Minnesota, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Albuquerque, New Mexico, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: New Brunswick, New Jersey, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Greenville, North Carolina, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Huntersville, North Carolina, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Sterling Heights, Michigan, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Cleveland Heights, Ohio, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Pawtucket, Rhode Island, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Charlotte, North Carolina, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Buffalo, New York, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Battle Creek, Michigan, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: New Rochelle, New York, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Spanish Fork, Utah, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Great Falls, Montana, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...

Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

Hiring Organisation: Mercor
Location: Passaic, New Jersey, United States
Employment Type: Permanent
Salary: USD 250 Hourly

Role Overview Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while … maintaining correctness and reproducibility, 2) Key Responsibilities - Develop, tune, and benchmark CUDA kernels for tensor and operator workloads. - Optimize for occupancy, memory coalescing, instruction-level parallelism, and warp scheduling. - Profile and diagnose performance bottlenecks using Nsight Systems, Nsight Compute, and comparable tools. - Report performance metrics, analyze speedups, and propose ...