NVIDIA 900-2G183-0000-001 Tesla T4 Graphic Card - 75W - 16 GB - PCIe - Full Height
SabrePC B2B Account Services
Save instantly and shop with assurance knowing that you have a dedicated account team a phone call or email away to help answer any of your questions with a B2B account.
- Business-Only Pricing
- Personalized Quotes
- Fast Delivery
- Products and Support

$2,299.00
NVIDIA 900-2G183-0000-001 Tesla T4 Graphic Card - 75W - 16 GB - PCIe - Full Height
Next-Level Acceleration Has Arrived
We're racing toward the future where every customer interaction, every product, and every service offering will be touched and improved by AI. Realizing that the future requires a computing platform that can accelerate the full diversity of modern AI, enabling businesses to create new customer experiences, reimagine how they meet-and exceed-customer demands, and cost-effectively scale their AI-based products and services.
The NVIDIA® T4 GPU accelerates diverse cloud workloads, including high-performance computing, deep learning training and inference, machine learning, data analytics, and graphics. Based on the new NVIDIA Turing™ architecture and packaged in an energy-efficient 70-watt, small PCIe form factor, T4 is optimized for scale-out computing environments and features multi-precision Turing Tensor Cores and new RT Cores. Combined with accelerated containerized software stacks from NGC, T4 delivers revolutionary performance at scale.
Breakthrough Performance
T4 introduces the revolutionary Turing Tensor Core technology with multi-precision computing to handle diverse workloads. Powering breakthrough performance from FP32 to FP16 to INT8, as well as INT4 precisions, T4 delivers up to 40X higher performance than CPUs.
STATE-OF-THE-ART INFERENCE IN REAL-TIME
Responsiveness is key to user engagement for services such as conversational AI, recommender systems, and visual search. As models increase in accuracy and complexity, delivering the right answer right now requires exponentially larger compute capability. T4 delivers up to 40X times better low-latency throughput, so more requests can be served in real time.