inference Archives - insideAI News https://insideainews.com/tag/inference/ Illuminating AI's Frontiers: Your Go-To News Destination. Wed, 22 Mar 2023 21:26:58 +0000 en-US hourly 1 https://wordpress.org/?v=6.6.1 https://insideainews.com/wp-content/uploads/2024/06/iain-favicon.png inference Archives - insideAI News https://insideainews.com/tag/inference/ 32 32 136462205 NVIDIA Launches Inference Platforms for Large Language Models and Generative AI Workloads https://insideainews.com/2023/03/22/nvidia-launches-inference-platforms-for-large-language-models-and-generative-ai-workloads/ https://insideainews.com/2023/03/22/nvidia-launches-inference-platforms-for-large-language-models-and-generative-ai-workloads/#respond Wed, 22 Mar 2023 13:00:00 +0000 https://insidebigdata.com/?p=31899 NVIDIA launched four inference platforms optimized for a diverse set of rapidly emerging generative AI applications — helping developers quickly build specialized, AI-powered applications that can deliver new services and insights. The platforms combine NVIDIA’s full stack of inference software with the latest NVIDIA Ada, NVIDIA Hopper™ and NVIDIA Grace Hopper™ processors — including the NVIDIA L4 Tensor Core GPU and the NVIDIA H100 NVL GPU, both launched at GTC.]]> https://insideainews.com/2023/03/22/nvidia-launches-inference-platforms-for-large-language-models-and-generative-ai-workloads/feed/ 0 31899 Intel’s Habana Labs Launches Second-Generation AI Processors for Training and Inferencing https://insideainews.com/2022/05/10/intels-habana-labs-launches-second-generation-ai-processors-for-training-and-inferencing/ https://insideainews.com/2022/05/10/intels-habana-labs-launches-second-generation-ai-processors-for-training-and-inferencing/#respond Tue, 10 May 2022 15:00:00 +0000 https://insidebigdata.com/?p=29306 Intel announced that Habana Labs, its data center team focused on AI deep learning processor technologies, launched its second-generation deep learning processors for training and inference: Habana® Gaudi®2 and Habana® Greco™. These new processors address an industry gap by providing customers with high-performance, high-efficiency deep learning compute choices for both training workloads and inference deployments in the data center while lowering the AI barrier to entry for companies of all sizes.]]> https://insideainews.com/2022/05/10/intels-habana-labs-launches-second-generation-ai-processors-for-training-and-inferencing/feed/ 0 29306 TensorRT 8 Provides Leading Enterprises Fast AI Inference Performance https://insideainews.com/2021/07/20/tensorrt-8-provides-leading-enterprises-fast-ai-inference-performance/ https://insideainews.com/2021/07/20/tensorrt-8-provides-leading-enterprises-fast-ai-inference-performance/#respond Tue, 20 Jul 2021 13:30:00 +0000 https://insidebigdata.com/?p=26724 NVIDIA today launched TensorRT™ 8, the eighth generation of the company’s AI software, which slashes inference time in half for language queries -- enabling developers to build the world’s best-performing search engines, ad recommendations and chatbots and offer them from the cloud to the edge.]]> https://insideainews.com/2021/07/20/tensorrt-8-provides-leading-enterprises-fast-ai-inference-performance/feed/ 0 26724