L40S NVL

NVIDIA

NVIDIA · NVIDIA Ada Lovelace Architecture · Released Q3 2023 · 18,176 CUDA Cores · 48GB GDDR6 with ECC VRAM · 350W TDP

L40S NVL

OVERVIEW

The NVIDIA L40S NVL is a high-performance datacenter GPU designed for AI, machine learning, and high-performance computing workloads. It is part of the Ada Lovelace architecture, offering enhanced performance and efficiency over previous generations. Targeted at enterprise and cloud environments, it features advanced capabilities for large-scale AI model training and inference, making it a key component in modern AI infrastructure.

SPECIFICATIONS

RT Cores142
CUDA Cores18176
GPU Memory48GB GDDR6 with ECC
NEBS ReadyLevel 3
FP32 TFLOPS91.6
Form Factor4.4" (H) x 10.5" (L), dual slot
Tensor Cores568
Display Ports4x DisplayPort 1.4a
NVENC | NVDEC3x | 3x (includes AV1 encode and decode)
FP8 Tensor Core733
Power Connector16-pin
GPU ArchitectureNVIDIA Ada Lovelace Architecture
Memory Bandwidth864GB/s
Max Power Consumption350W
Peak INT4 Tensor TOPS733
Peak INT8 Tensor TOPS733
Interconnect InterfacePCIe Gen4 x16: 64GB/s bidirectional
FP16 Tensor Core TFLOPS362.05
TF32 Tensor Core TFLOPS183
RT Core Performance TFLOPS209
BFLOAT16 Tensor Core TFLOPS362.05
Secure Boot With Root of TrustYes
Virtual GPU (vGPU) Software SupportYes

WHAT THIS GPU IS GOOD AT

The L40S NVL excels in AI and machine learning workloads, particularly in training large neural networks and performing complex inference tasks. Its architecture provides significant performance improvements in FP16 and INT8 operations, making it ideal for deep learning applications. The GPU's high memory bandwidth and capacity also support data-intensive tasks, setting it apart from alternatives.

SERVER OPTIONS

The NVIDIA L40S NVL is available in various server platforms, including NVIDIA's own DGX systems and OEM servers from major vendors like Dell PowerEdge, HPE ProLiant, and Supermicro. It is also offered in cloud instances such as AWS p5, Azure ND, and GCP A3, providing flexible deployment options for enterprises and researchers.

POWER, THERMALS & NOISE

The L40S NVL has a high TDP, necessitating robust cooling solutions, typically liquid cooling in datacenter environments. It is designed to maintain optimal thermal performance under heavy workloads, though noise levels are generally not a primary concern in server deployments. Efficient cooling is crucial to prevent thermal throttling and ensure consistent performance.

COMPATIBILITY & SYSTEM FIT

This GPU is available in the PCIe form factor, supporting PCIe Gen 4 for high data throughput. It is compatible with NVLink, allowing for high-speed interconnects between multiple GPUs. Adequate power supply is essential, with specific PSU requirements depending on the server configuration. It fits into standard server chassis designed for high-performance computing.

LIMITATIONS & KNOWN TRADE-OFFS

While the L40S NVL offers exceptional performance, it comes with a high power consumption and cost, which may not be suitable for all budgets. Availability can be constrained due to high demand in AI and HPC sectors. Users should also consider the need for advanced cooling solutions to manage its thermal output effectively.

PRICING

Vendor
Price
Unit
Currency
Date Added
Atlantic.Net
View listing →
$1.58
hour
USD
Feb 2, 2026

NOTES

Unparalleled AI and graphics performance for the data center. Ideal for generative AI, large language model training and inference, rendering, 3D graphics, and video content AI training.

"For more information visit: www.pny.eu"