* . *
  • Tech News
    Nikon’s Z5 II is the cheapest full-frame camera yet with internal RAW video

    Nikon’s Z5 II is the cheapest full-frame camera yet with internal RAW video

    The Morning After: Let’s talk Switch 2 pricing

    The Morning After: Let’s talk Switch 2 pricing

    Amazon’s ‘Buy for Me’ AI will purchase stuff from third-party websites

    Amazon’s ‘Buy for Me’ AI will purchase stuff from third-party websites

    Vibe coding at enterprise scale: AI tools now tackle the full development lifecycle

    Vibe coding at enterprise scale: AI tools now tackle the full development lifecycle

  • Reviews
  • Noteworthy
  • Science
  • Opinions
  • Applications
  • Blockchain
    Gain an edge with DTX’s groundbreaking Hybrid Blockchain: Presale now open for LINK and XRP Traders

    Gain an edge with DTX’s groundbreaking Hybrid Blockchain: Presale now open for LINK and XRP Traders

    Unraveling the Mystery: What Exactly is Blockchain Technology?

    Unraveling the Mystery: What Exactly is Blockchain Technology?

    Revolutionary Gasless Blockchain Gaming Partnership Between Atari Founder’s New Firm and Skale Labs

    Discover the Exciting Outcome of a Blockchain Experiment: Decentralized Learning Robots Swarm to Success

    Unleashing a Swarm of Decentralized Learning Robots: The Surprising Results of Blockchain Experiment

    Vishvasya: Revolutionizing Citizen-Centric Apps with National Blockchain Framework for Enhanced Security and Transparency

    Vishvasya: Revolutionizing Citizen-Centric Apps with National Blockchain Framework for Enhanced Security and Transparency

  • Applications
  • Culture
  • Deals
  • Events
  • How-to
  • Roundups
  • Startups
Wednesday, May 14, 2025
No Result
View All Result
Tech News, Magazine & Review WordPress Theme 2017
  • Contact Us
  • Legal
    • Privacy Policy
    • Terms of Use
    • DMCA
    • Cookie Privacy Policy
    • California Consumer Privacy Act (CCPA)
  • Tech News
    Nikon’s Z5 II is the cheapest full-frame camera yet with internal RAW video

    Nikon’s Z5 II is the cheapest full-frame camera yet with internal RAW video

    The Morning After: Let’s talk Switch 2 pricing

    The Morning After: Let’s talk Switch 2 pricing

    Amazon’s ‘Buy for Me’ AI will purchase stuff from third-party websites

    Amazon’s ‘Buy for Me’ AI will purchase stuff from third-party websites

    Vibe coding at enterprise scale: AI tools now tackle the full development lifecycle

    Vibe coding at enterprise scale: AI tools now tackle the full development lifecycle

  • Reviews
  • Noteworthy
  • Science
  • Opinions
  • Applications
  • Blockchain
    Gain an edge with DTX’s groundbreaking Hybrid Blockchain: Presale now open for LINK and XRP Traders

    Gain an edge with DTX’s groundbreaking Hybrid Blockchain: Presale now open for LINK and XRP Traders

    Unraveling the Mystery: What Exactly is Blockchain Technology?

    Unraveling the Mystery: What Exactly is Blockchain Technology?

    Revolutionary Gasless Blockchain Gaming Partnership Between Atari Founder’s New Firm and Skale Labs

    Discover the Exciting Outcome of a Blockchain Experiment: Decentralized Learning Robots Swarm to Success

    Unleashing a Swarm of Decentralized Learning Robots: The Surprising Results of Blockchain Experiment

    Vishvasya: Revolutionizing Citizen-Centric Apps with National Blockchain Framework for Enhanced Security and Transparency

    Vishvasya: Revolutionizing Citizen-Centric Apps with National Blockchain Framework for Enhanced Security and Transparency

  • Applications
  • Culture
  • Deals
  • Events
  • How-to
  • Roundups
  • Startups
No Result
View All Result
Tech News
No Result
View All Result

Turbocharging AI: How the Apple-Nvidia Collaboration is Revolutionizing Model Production!

December 22, 2024
in Apple
Home Apple

Our mission is to provide unbiased product reviews and timely reporting of technological advancements. Covering all latest reviews and advances in the technology industry, our editorial team strives to make every click count. We aim to provide fair and unbiased information about the latest technological advances.
Share on FacebookShare on Twitter

Developing⁢ models for machine learning requires extensive computational resources

Recent advancements in machine learning by Apple are set to significantly enhance‍ the efficiency of generating models for Apple Intelligence. A newly introduced method has been found⁤ to nearly triple the‍ speed of token generation using Nvidia GPUs.

ADVERTISEMENT

Generating large language ‌models (LLMs) presents various challenges, particularly inefficiencies during the initial stages of ⁣their creation. The entire process of training machine learning ⁤models is both resource-heavy and ⁢time-consuming, often leading developers to invest heavily in additional hardware and face rising energy expenses.

Earlier‍ this year, Apple announced and made available its innovative Recurrent‍ Drafter technology—abbreviated as ReDrafter.​ This ‌technique utilizes speculative decoding to accelerate performance during training phases by employing a recurrent neural network that hybridizes beam‍ search with dynamic tree attention‌ for‍ optimizing draft tokens from numerous pathways.

As a result, this approach⁣ can improve LLM token generation speeds by ‌up to 3.5 times compared to conventional auto-regressive methods typically used in ⁣the field.

In a recent update on Apple’s Machine‍ Learning Research platform, it was⁣ reported that efforts continued beyond just integrating with Apple Silicon. The latest findings shared on Wednesday focused on adapting ReDrafter⁣ so it could be effectively utilized alongside ⁢Nvidia GPUs for production environments.

Nvidia’s high-performance GPUs are frequently deployed within servers dedicated⁤ to LLM generation; however, procuring such advanced ​hardware‍ can ​be prohibitively expensive. It is common for multi-GPU setups to exceed ⁢costs of $250,000 excluding ancillary⁤ infrastructure expenditures.

Apple collaborated closely with Nvidia engineers to seamlessly incorporate ⁢ReDrafter into the Nvidia TensorRT-Language Model (LLM) inference acceleration⁤ framework, necessitating new⁢ elements due to distinctive operational⁢ features used by ReDrafter not present ⁢in‌ many existing speculative decoding techniques.

Following this integration, machine learning developers leveraging ‌Nvidia GPUs now have access to ReDrafter’s enhanced token generation ​capabilities‌ through TensorRT-LMM without ‍limitations solely benefiting those utilizing Apple hardware.

Benchmark tests conducted ​on expansive LLMs ⁣containing tens⁢ of billions of parameters using Nvidia systems demonstrated ⁢an increase ⁢in⁢ token output rates per second by ⁤approximately ⁤2.7 ⁣times when employing greedy encoding tactics.

The​ practical impact is substantial—this advancement stands poised not only to reduce latency faced‍ by users but also lower ‍the overall hardware requirements necessary for operation. Ultimately, clients should receive swifter ‌responses from cloud queries while organizations can operate more efficiently at lower costs.

Nvidias technical blog highlighted that through collaborative efforts enhancing TensorRT-LMM’s functionality and adaptability would‌ empower developers within the LLM ecosystem fostering⁤ innovation around sophisticated model development along with easier deployment processes.

>

The publication outlining these developments comes‌ parallelly after Apples acknowledgment‍ regarding their exploration into Amazon’s Trainium2‍ chip application intended toward augmenting training efficiencies linking back towards expected gains ‍up deductive half over current methodologies employed.

Tags: AIAppleAppleNvidiaArtificial intelligencecollaborationInnovationMachine learningModelmodel productionNVIDIAproductionspeedstechnologyturbocharging AI

Denial of responsibility! tech-news.info is an automatic aggregator around the global media. All the content are available free on Internet. We have just arranged it in one platform for educational purpose only. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, all materials to their authors. If you are the owner of the content and do not want us to publish your materials on our website, please contact us by email – abuse@tech-news.info. The content will be deleted within 24 hours.
Previous Post

Unleash Your Inner Gamer: Dive into Steam Replay 2024 and Showcase Your Balatro Playtime Against Friends!

Next Post

Qualcomm Emerges Victorious: Licensing Dispute with ARM Settled!

RelatedPosts

Apple users are ditching the AirTag for this  alternative… but why?
Apple

Apple users are ditching the AirTag for this $30 alternative… but why?

April 5, 2025
How to use the new, easier Guest Mode on Vision Pro
Apple

How to use the new, easier Guest Mode on Vision Pro

April 5, 2025
iPhones Could Cost Up to ,300 in the U.S. Due to Tariffs, Analyst Says
Apple

iPhones Could Cost Up to $2,300 in the U.S. Due to Tariffs, Analyst Says

April 5, 2025
Apple will take a  billion hit to its bottom line because of Trump tariffs
Apple

Apple will take a $33 billion hit to its bottom line because of Trump tariffs

April 5, 2025
ADVERTISEMENT
Galaxy Ring wireless charging upgrade could ditch the case – Phandroid

Galaxy Ring wireless charging upgrade could ditch the case – Phandroid

April 5, 2025

Nikon’s Z5 II is the cheapest full-frame camera yet with internal RAW video

April 5, 2025

Mechanistic understanding could enable better fast-charging batteries

April 5, 2025

Apple users are ditching the AirTag for this $30 alternative… but why?

April 5, 2025

Grab the 2nd Gen Google Nest for Less than 100 Bucks! – Phandroid

April 5, 2025

How to use the new, easier Guest Mode on Vision Pro

April 5, 2025

The Morning After: Let’s talk Switch 2 pricing

April 5, 2025

Charging electric vehicles 5x faster in subfreezing temps

April 5, 2025

Deals: Moto Edge 60 Fusion and Pixel 9a arrive, iPhone 16  and 15 series are £100 off

April 5, 2025

iPhones Could Cost Up to $2,300 in the U.S. Due to Tariffs, Analyst Says

April 5, 2025

Categories

Select Category

    Archives

    Select Month
      May 2025
      MTWTFSS
       1234
      567891011
      12131415161718
      19202122232425
      262728293031 
      « Apr    
      • California Consumer Privacy Act (CCPA)
      • Contact Us
      • Cookie Privacy Policy
      • DMCA
      • Privacy Policy
      • Tech News
      • Terms of Use

      © 2015-2024 Tech-News.info
      DMCA.com Protection Status

      No Result
      View All Result
      • California Consumer Privacy Act (CCPA)
      • Contact Us
      • Cookie Privacy Policy
      • DMCA
      • Privacy Policy
      • Tech News
      • Terms of Use

      © 2015-2024 Tech-News.info
      DMCA.com Protection Status

      This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
      Go to mobile version