NVIDIA Unveils Nemotron 3 Nano Omni, a Multimodal Model with Up to 92x Higher Throughput

AI 04.29.26

on april 28, local time, nvidia officially unveiled nemotron 3 nano omni, an open-source, multimodal inference model designed to provide an integrated foundation model for enterprise-grade ai agents. built on a 30-billion-parameter, a3b mixture-of-experts (moe) architecture, the model can dynamically activate based on tasks and modalities, delivering high throughput and scalable multimodal performance.

unlike traditional solutions that rely on fragmented chains of vision–speech–language models, nemotron 3 nano omni integrates unified multimodal inference across video, audio, images, and text into a single, efficient, open model, thereby reducing inference hops and orchestration complexity, significantly lowering inference costs, and enhancing cross-modal contextual consistency. under a fixed interaction latency threshold, the model’s effective system capacity in video inference tasks is up to approximately 9.2 times higher than that of other open-source multimodal models, and up to about 7.4 times higher in multi-document inference tasks.

this model can serve as a multimodal perception and context sub-agent within agent systems, enabling agents to process visual, audio, and textual inputs within a single, shared “perception–action” loop. on the document intelligence benchmarks mmlongbench-doc and ocrbenchv2, it achieves state-of-the-art accuracy in its class, and also delivers outstanding performance on video and audio understanding benchmarks such as worldsense, dailyomni, and voicebench. in terms of architectural design, nemotron 3 nano omni combines mamba layers—designed to enhance sequence and memory efficiency—with transformer layers—optimized for precise inference—resulting in up to fourfold improvements in memory and computational efficiency. visual processing employs 3d convolutions to capture inter-frame motion, the audio component is based on nvidia’s parakeet encoder, and the text component uses a powerful language model as its central decoder.

the model’s weights are currently available on hugging face and will soon be deployed as an nvidia nim microservice, allowing developers to freely customize, deploy, and integrate multimodal sub-agents.

The rumored M6‑based MacBook Pro may, for the first time, feature 5G cellular connectivity

according to multiple supply-chain sources, apple is accelerating preparations for mass production of the m6‑series macbook pro, which is expected to launch o

06.15.26 0

Samsung may equip its widescreen foldable phone with an innovative hinge technology similar to the rumored iPhone Fold, designed to significantly reduce screen creases and enhance overall device reliability

according to the latest report from korean media outlet zdnet korea, samsung is exploring a more robust display approach for its next-generation vertically fold

06.15.26 1

MIT has developed an innovative dual-mode propulsion system, specifically designed for deep-space missions with microsatellites, achieving, for the first time, a breakthrough in simultaneously enhancing both propulsion performance and energy efficiency at

the massachusetts institute of technology (mit) aerospace team is breaking through bottlenecks in micro‑ and nano‑satellite propulsion technology, developing

06.15.26 1

Ukraine has, for the first time, deployed “Terminator” AI drones to carry out autonomous target identification and strike missions, successfully killing Russian frontline personnel

according to an exclusive report by new scientist, ukraine’s military application of artificial intelligence has reached a historic turning point: in 2024, du

06.15.26 1

Apple has abandoned the Face ID solution for the iPhone Ultra, opting instead for an in‑display side-mounted fingerprint sensor—a groundbreaking design that tech bloggers have hailed as “like a dream”

a tech blogger exclaimed on social media: “apple has truly turned the $2,000 iphone ultra into reality—touch id is back on the power button, and face id is c

06.15.26 0

Attorneys general from multiple US states have launched a joint investigation into OpenAI, focusing on its advertising practices and measures to safeguard content for minors

a multi-state joint investigative team, composed of attorneys general from several states, has officially launched a formal review of the artificial intelligen

06.15.26 1

The Samsung Galaxy S27 and Xiaomi’s Mi 18 series unexpectedly appeared in the same global certification database, sparking speculation within the industry about potential adjustments to Xiaomi’s product roadmap

although several months remain before their official unveiling, the next‑generation flagship models from samsung and xiaomi have quietly appeared in global ce

06.15.26 1

Apple is secretly developing a brand-new, in-house camera app, which could make its debut alongside the iPhone 18 Pro lineup

at wwdc 2026, while apple unveiled major system updates like ios 27, it deliberately held back a key imaging feature—a completely redesigned camera app that o

06.15.26 0

Lenovo has officially launched the new Yoga Pro 7 15-inch laptop, which globally debuts support for dynamic video memory allocation technology, allowing up to 96 GB of system memory to be flexibly allocated as video memory

lenovo has officially unveiled its new flagship yoga pro 7 15ash11 laptop, built around the amd strix halo platform to redefine the high-end mobile creative ex

06.15.26 1

Huawei’s FreeClip 2 Collector’s Edition has been officially launched, debuting with the HarmonyOS 6 operating system

at 10:08 on june 15, huawei terminal officially launched the freeclip 2 collector’s edition earclip-style headphones, with a launch price of 1,499 yuan. this

06.15.26 0

The OnePlus Turbo 6X series has officially launched, pre-installed with the all-new ColorOS 16, delivering a smooth user experience that lasts for six years

on june 15, oneplus officially launched its brand-new turbo 6x series and simultaneously kicked off its first-ever omnichannel sales. the lineup includes two f

06.15.26 0

Key specs of the OPPO Find X10 Pro have surfaced: it will debut MediaTek’s flagship chip built on TSMC’s 2nm process, and its imaging system features a dual 200-megapixel ultra‑clear main camera

recently, the specifications of a flagship dimensity engineering prototype built on the cutting-edge 2nm process were unexpectedly leaked, drawing widespread a

06.15.26 0

At present, Siri’s intelligent interaction capabilities are roughly on par with the technological level of mainstream AI chatbots from six months ago

apple’s ai strategy has reached a pivotal turning point: the all-new siri is redefining the boundaries of smart assistants with its contextual understanding a

06.15.26 1

Goldman Sachs’ latest research report indicates that the market’s current assessment of the true demand for artificial intelligence is markedly conservative, while corporate‑level AI investment continues to gain momentum The report forecasts that global t

goldman sachs’ latest research report points out that the market has significantly misjudged the pace of ai infrastructure expansion—far from peaking, the wa

06.15.26 1

Oracle has announced adjustments to the resource quotas of its permanently free cloud service tiers: once the limits are exceeded, the services will be automatically suspended, and any usage beyond the quota will be billed on a pay-as-you-go basis

oracle cloud recently announced an official update to its free-tier policy, stating that starting june 15, 2026, the permanently free arm‑based plan for globa

06.15.26 0