Spark, a lightweight real-time coding model powered by Cerebras hardware and optimized for ultra-low latency performance.
Nvidia noted that cost per token went from 20 cents on the older Hopper platform to 10 cents on Blackwell. Moving to Blackwell’s native low-precision NVFP4 format further reduced the cost to just 5 ...