Groups Similar Look up By Text Browse About



Similar articles
Article Id Title Prob Score Similar Compare
212447 ZDNET 2021-7-20:
Nvidia announces launch of TensorRT 8 designed for chatbots, recommendations, and search
1.000 Find similar Compare side-by-side
212209 VENTUREBEAT 2021-7-20:
Nvidia releases TensorRT 8 for faster AI inference
0.983 0.733 Find similar Compare side-by-side
211943 VENTUREBEAT 2021-7-16:
OpenAI disbands its robotics research team
0.415 Find similar Compare side-by-side
212224 VENTUREBEAT 2021-7-20:
Untether AI nabs $125M for AI acceleration chips
0.387 Find similar Compare side-by-side
212346 VENTUREBEAT 2021-7-19:
Scaling AI and data science – 10 smart ways to move from pilot to production
0.385 Find similar Compare side-by-side
212312 VENTUREBEAT 2021-7-16:
AI and financial processes: Balancing risk and reward
0.378 Find similar Compare side-by-side
212293 VENTUREBEAT 2021-7-16:
Announcing the AI Innovation Awards winners at Transform 2021
0.377 Find similar Compare side-by-side
211947 VENTUREBEAT 2021-7-16:
VentureBeat presents AI Innovation Awards nominees at Transform 2021
0.368 Find similar Compare side-by-side
212497 TECHREPUBLIC 2021-7-22:
How low-code development could boost AI adoption
0.368 Find similar Compare side-by-side
212304 VENTUREBEAT 2021-7-16:
What to do when AI brings more questions than answers
0.359 Find similar Compare side-by-side
212593 ZDNET 2021-7-23:
Contentsquare acquires Upstride to speed up AI innovation for digital business
0.357 Find similar Compare side-by-side
212242 VENTUREBEAT 2021-7-20:
Lucata raises $11.9M to accelerate graph analytics with specialized hardware
0.356 Find similar Compare side-by-side
212480 VENTUREBEAT 2021-7-22:
Airbnb CTO says graph neural networks will be big in 2021
0.351 Find similar Compare side-by-side
212296 VENTUREBEAT 2021-7-16:
Announcing the winners of the Women in AI Awards at Transform 2021
0.344 Find similar Compare side-by-side
212096 VENTUREBEAT 2021-7-16:
Facebook’s BlenderBot 2.0 bot surfs the web for knowledge
0.344 Find similar Compare side-by-side
212566 VENTUREBEAT 2021-7-22:
DeepMind open-sources protein structure dataset generated by AlphaFold 2
0.339 Find similar Compare side-by-side
212307 VENTUREBEAT 2021-7-16:
AI Weekly: Can AI predict labor market trends?
0.332 Find similar Compare side-by-side
212472 VENTUREBEAT 2021-7-23:
How an AI entrepreneur deals with dirty real-world data
0.332 Find similar Compare side-by-side
212265 VENTUREBEAT 2021-7-20:
Employees want more AI to boost productivity, study finds
0.328 Find similar Compare side-by-side
212340 VENTUREBEAT 2021-7-18:
Duke Energy used computer vision and robots to cut costs by $74M
0.328 Find similar Compare side-by-side
212475 VENTUREBEAT 2021-7-22:
Equipping AI with emotional intelligence can improve outcomes
0.322 Find similar Compare side-by-side
212375 VENTUREBEAT 2021-7-21:
BlueOcean raises $15M to measure brand sentiment with AI
0.312 Find similar Compare side-by-side
212383 TECHREPUBLIC 2021-7-20:
These organizations are using AI to reshape operations in surprising ways
0.305 Find similar Compare side-by-side
212520 VENTUREBEAT 2021-7-21:
Algorithmia founder on MLOps’ promise and pitfalls
0.303 Find similar Compare side-by-side
212271 VENTUREBEAT 2021-7-20:
Freshworks: 93% of IT managers have deployed AI, or plan to soon
0.302 Find similar Compare side-by-side

1

ID: 212447

URL: https://www.zdnet.com/article/nvidia-announces-launch-of-tensorrt-8-designed-for-chatbots-recommendations-and-search/

Date: 2021-07-20

Nvidia announces launch of TensorRT 8 designed for chatbots, recommendations, and search

The eighth generation of Nvidia's AI software is able to cut inference time in half for language queries. Nvidia unveiled the eighth generation of its widely used TensorRT on Tuesday, announcing that the AI software is twice as powerful and accurate as its predecessor while cutting inference time in half for language queries. Tensor RT is used by hundreds of companies for things like search engines, ad recommendations, and chatbots. Siddharth Sharma, head of the product marketing team for Nvidia's AI software, told reporters on Monday that it has been downloaded more than 2.5 million times and is in use by companies like American Express, Verizon, LG, Ford, SK Telecom, KLA, Naver, GE Healthcare and USPS.  "TensorRT 8 is twice as powerful as 7, twice as accurate as TensorRT 7, and it supports sparsity which can dramatically reduce the amount of compute and memory needed for running applications," Sharma said. "With this achievement, you can now deploy the entire Bert-Large within a millisecond. That is huge and I believe that is going to lead to a completely new generation of conversational AI applications. A level of smartness, a level of latency that was unheard of before." Sharma explained that TensorRT 8's optimizations also allow for "record-setting speed for language applications, running BERT-Large, one of the world's most widely used transformer-based models, in 1.2 milliseconds." "In the past, companies had to reduce their model size which resulted in significantly less accurate results. Now, with TensorRT 8, companies can double or triple their model size to achieve dramatic improvements in accuracy," Sharma added.  TensorRT 8 is now available and free of charge to Nvidia Developer program members. The TensorRT GitHub repository also has the latest versions of plug-ins, parsers, and samples. Greg Estes, vice president of developer programs at Nvidia, said AI models are growing exponentially more complex, and worldwide demand is surging for real-time applications that use AI.  The latest version of TensorRT, Estes said, introduces new capabilities that enable companies to deliver conversational AI applications to their customers "with a level of quality and responsiveness that was never before possible. " Over the last five years, Nvidia said that more than 350,000 developers across 27,500 companies have used TensorRT, and Estes noted that TensorRT applications "can be deployed in hyperscale data centers, embedded or automotive product platforms." Sharma told reporters that TensorRT 8's unique AI inference was made possible through Sparsity and Quantization, two key features that increase efficiency and allow developers to use "trained models to run inference in INT8 precision without losing accuracy." GE Healthcare uses TensorRT in computer vision applications for ultrasounds, and Erik Steen, chief engineer of Cardiovascular Ultrasound at GE Healthcare, said the tool was vital in helping clinicians move faster.  "When it comes to ultrasound, clinicians spend valuable time selecting and measuring images. During the R&D project leading up to the Vivid Patient Care Elevated Release, we wanted to make the process more efficient by implementing automated cardiac view detection on our Vivid E95 scanner," Steen said. " The cardiac view recognition algorithm selects appropriate images for analysis of cardiac wall motion. TensorRT, with its real-time inference capabilities, improves the performance of the view detection algorithm and it also shortened our time to market during the R&D project."