• About Us
  • Privacy Policy
  • Disclaimer
  • Contact Us
TechTrendFeed
  • Home
  • Tech News
  • Cybersecurity
  • Software
  • Gaming
  • Machine Learning
  • Smart Home & IoT
No Result
View All Result
  • Home
  • Tech News
  • Cybersecurity
  • Software
  • Gaming
  • Machine Learning
  • Smart Home & IoT
No Result
View All Result
TechTrendFeed
No Result
View All Result

Bringing AI to UK Languages With NVIDIA Nemotron

Admin by Admin
September 15, 2025
Home Machine Learning
Share on FacebookShare on Twitter


Celtic languages — together with Cornish, Irish, Scottish Gaelic and Welsh — are the U.Ok.’s oldest residing languages. To empower their audio system, the UK-LLM sovereign AI initiative is constructing an AI mannequin based mostly on NVIDIA Nemotron that may cause in each English and Welsh, a language spoken by about 850,000 individuals in Wales at the moment.

Enabling high-quality AI reasoning in Welsh will help the supply of public providers together with healthcare, training and authorized assets within the language.

“I would like each nook of the U.Ok. to have the ability to harness the advantages of synthetic intelligence. By enabling AI to cause in Welsh, we’re ensuring that public providers — from healthcare to training — are accessible to everybody, within the language they dwell by,” mentioned U.Ok. Prime Minister Keir Starmer. “This can be a highly effective instance of how the newest AI expertise, educated on the U.Ok.’s most superior AI supercomputer in Bristol, can serve the general public good, shield cultural heritage and unlock alternative throughout the nation.”

The UK-LLM venture, established in 2023 as BritLLM and led by College Faculty London, has beforehand launched two fashions for U.Ok. languages. Its new mannequin for Welsh, developed in collaboration with Wales’ Bangor College and NVIDIA, aligns with Welsh authorities efforts to spice up the lively use of the language, with the purpose of reaching one million audio system by 2050 — an initiative generally known as Cymraeg 2050.

U.Ok.-based AI cloud supplier Nscale will make the brand new mannequin obtainable to builders by its software programming interface.

“The intention is to make sure that Welsh stays a residing, respiration language that continues to develop with the occasions,” mentioned Gruffudd Prys, senior terminologist and head of the Language Applied sciences Unit at Canolfan Bedwyr, the college’s middle for Welsh language providers, analysis and expertise. “AI exhibits huge potential to assist with second-language acquisition of Welsh in addition to for enabling native audio system to enhance their language expertise.”

This new mannequin might additionally increase the accessibility of Welsh assets by enabling public establishments and companies working in Wales to translate content material or present bilingual chatbot providers. This may help teams together with healthcare suppliers, educators, broadcasters, retailers and restaurant house owners guarantee their written content material is as available in Welsh as they’re in English.

Past Welsh, the UK-LLM staff goals to use the identical methodology used for its new mannequin to develop AI fashions for different languages spoken throughout the U.Ok. comparable to Cornish, Irish, Scots and Scottish Gaelic — in addition to work with worldwide collaborators to construct fashions for languages from Africa and Southeast Asia.

“This collaboration with NVIDIA and Bangor College enabled us to create new coaching knowledge and prepare a brand new mannequin in file time, accelerating our purpose to construct the all-time language mannequin for Welsh,” mentioned Pontus Stenetorp, professor of pure language processing and deputy director for the Centre of Synthetic Intelligence at College Faculty London. “Our intention is to take the insights gained from the Welsh mannequin and apply them to different minority languages, within the U.Ok. and throughout the globe.”

Tapping Sovereign AI Infrastructure for Mannequin Improvement 

The brand new mannequin for Welsh relies on NVIDIA Nemotron, a household of open-source fashions that options open weights, datasets and recipes. The UK-LLM growth staff has tapped the 49-billion-parameter Llama Nemotron Tremendous mannequin and 9-billion-parameter Nemotron Nano mannequin, post-training them on Welsh-language knowledge.

In contrast with languages like English or Spanish, there’s much less obtainable supply knowledge in Welsh for AI coaching. So to create a sufficiently giant Welsh coaching dataset, the staff used NVIDIA NIM microservices for gpt-oss-120b and DeepSeek-R1 to translate NVIDIA Nemotron open datasets with over 30 million entries from English to Welsh.

They used a GPU cluster by the NVIDIA DGX Cloud Lepton platform and are harnessing lots of of NVIDIA GH200 Grace Hopper Superchips on Isambard-AI — the U.Ok.’s strongest supercomputer, backed by £225 million in authorities funding and based mostly at College of Bristol — to speed up their translation and coaching workloads.

This new dataset dietary supplements present Welsh knowledge from the staff’s earlier efforts.

Capturing Linguistic Nuances With Cautious Analysis

Bangor College, situated in Gwynedd — the county with the highest share of Welsh audio system — is supporting the brand new mannequin’s growth with linguistic and cultural experience.

Welsh translation of: “The intention is to make sure that Welsh stays a residing, respiration language that continues to develop with the occasions.” — Gruffudd Prys, Bangor College

Prys, from the college’s Welsh-language middle, brings to the collaboration about twenty years of expertise with language expertise for Welsh. He and his staff are serving to to confirm the accuracy of machine-translated coaching knowledge and manually translated analysis knowledge, in addition to assess how the mannequin handles nuances of Welsh that AI sometimes struggles with — comparable to the way in which consonants at first of Welsh phrases change based mostly on neighboring phrases.

The mannequin, in addition to the Welsh coaching and analysis datasets, are anticipated to be made obtainable for enterprise and public sector use, supporting extra analysis, mannequin coaching and software growth.

“It’s one factor to have this AI functionality exist in Welsh, nevertheless it’s one other to make it open and accessible for everybody,” Prys mentioned. “That refined distinction may be the distinction between this expertise getting used or not getting used.”

Deploy Sovereign AI Fashions With NVIDIA Nemotron, NIM Microservices

The framework used to develop UK-LLM’s mannequin for Welsh can function a basis for multilingual AI growth around the globe.

Benchmark-topping Nemotron fashions, knowledge and recipes are publicly obtainable for builders to construct reasoning fashions tailor-made to nearly any language, area and workflow. Packaged as NVIDIA NIM microservices, Nemotron fashions are optimized for cost-effective compute and run wherever, from laptop computer to cloud.

Europe’s enterprises will be capable to run open, sovereign fashions on the Perplexity AI-powered search engine.

Get began with NVIDIA Nemotron.


Welsh translation: 

Ymestyn Ar Attracts yr Ynysoedd: Mae DU-LLM yn Dod â Deallusrwydd Artiffisial i Ieithoedd y DU Gyda NVIDIA Nemotron

Wedi’i hyfforddi ar yr uwch gyfrifiadur Isambard-AI, mae mannequin newydd a ddatblygwyd gan College Faculty London, NVIDIA a Phrifysgol Bangor yn manteisio ar dechnegau a setiau knowledge ffynhonnell agored NVIDIA Nemotron i alluogi rhesymu Deallusrwydd Artiffisial ar gyfer y Gymraeg ac ieithoedd eraill y DU ar gyfer gwasanaethau cyhoeddus gan gynnwys gofal iechyd, addysg ac adnoddau cyfreithiol.

Ieithoedd Celtaidd — gan gynnwys Cernyweg, Gwyddeleg, Gaeleg yr Alban a Chymraeg — yw ieithoedd byw hynaf y DU. Er mwyn grymuso eu siaradwyr, mae menter Deallusrwydd Artiffisial sofran y DU-LLM yn adeiladu mannequin Deallusrwydd Artiffisial yn seiliedig ar NVIDIA Nemotron a all resymu yn Saesneg a Chymraeg hefyd, iaith a siaredir gan tua 850,000 o bobl yng Nghymru heddiw.

Bydd galluogi rhesymu Deallusrwydd Artiffisial o ansawdd uchel yn y Gymraeg yn cefnogi’r ddarpariaeth o wasanaethau cyhoeddus gan gynnwys gofal iechyd, addysg ac adnoddau cyfreithiol yn yr iaith.

“Rwyf am i bob cwr o’r DU allu harneisio manteision deallusrwydd artiffisial. Drwy alluogi deallusrwydd artiffisial i resymu yn y Gymraeg, rydym yn sicrhau bod gwasanaethau cyhoeddus — o ofal iechyd i addysg — yn hygyrch i bawb, yn yr iaith maen nhw’n byw ynddi,” meddai Prif Weinidog y DU, Keir Starmer. “Mae hon yn enghraifft bwerus o sut y gall y dechnoleg dddiweddaraf, wedi’i hyfforddi ar uwch gyfrifiadur deallusrwydd artiffisial mwyaf datblygedig y DU ym Mryste, wasanaethu lles y cyhoedd, amddiffyn treftadaeth ddiwylliannol a datgloi cyfleoedd ledled y wlad.”

Mae prosiect DU-LLM, a sefydlwyd yn 2023 fel BritLLM ac a arweinir gan College Faculty London, wedi rhyddhau dau fodel ar gyfer ieithoedd y DU yn flaenorol. Mae ei fodel newydd ar gyfer y Gymraeg, a ddatblygwyd mewn cydweithrediad â Phrifysgol Bangor Cymru ac NVIDIA, yn cyd-fynd ag ymdrechion llywodraeth Cymru i hybu defnydd gweithredol o’r iaith, gyda’r nod o gyflawni miliwn o siaradwyr erbyn 2050 — menter o’r enw Cymraeg 2050.

Bydd darparwr cwmwl Deallusrwydd Artiffisial yn y DU, Nscale, yn sicrhau bod y mannequin newydd ar gael i ddatblygwyr trwy ei ryngwyneb rhaglennu rhaglenni (API).

“Y nod yw sicrhau bod y Gymraeg yn parhau i fod yn iaith fyw, sy’n anadlu ac sy’n parhau i ddatblygu gyda’r oes,” meddai Gruffudd Prys, uwch derminolegydd a phennaeth yr Uned Technolegau Iaith yng Nghanolfan Bedwyr, canolfan y brifysgol ar gyfer gwasanaethau, ymchwil a thechnoleg y Gymraeg. “Mae deallusrwydd artiffisial yn dangos potensial aruthrol i helpu gyda chaffael y Gymraeg fel ail iaith yn ogystal â galluogi siaradwyr brodorol i wella eu sgiliau iaith.”

Gallai’r mannequin newydd hwn hefyd roi hwb i hygyrchedd adnoddau Cymraeg drwy alluogi sefydliadau cyhoeddus a busnesau sy’n gweithredu yng Nghymru i gyfieithu cynnwys neu ddarparu gwasanaethau sgwrsfot dwyieithog. Gall hyn helpu grwpiau gan gynnwys darparwyr gofal iechyd, addysgwyr, darlledwyr, manwerthwyr a pherchnogion bwytai i sicrhau bod eu cynnwys ysgrifenedig yr un mor hawdd ar gael yn y Gymraeg ag y mae yn Saesneg.

Y tu hwnt i’r Gymraeg, mae tîm y DU-LLM yn anelu at gymhwyso’r un fethodoleg a ddefnyddiwyd ar gyfer ei fodel newydd i ddatblygu modelau Deallusrwydd Artiffisial ar gyfer ieithoedd eraill a siaredir ledled y DU fel Cernyweg, Gwyddeleg, Sgoteg a Gaeleg yr Alban — yn ogystal â gweithio gyda chydweithwyr rhyngwladol i adeiladu modelau ar gyfer ieithoedd o Affrica a De-ddwyrain Asia.

“Mae’r cydweithrediad hwn gydag NVIDIA a Phrifysgol Bangor wedi ein galluogi i greu knowledge hyfforddi newydd a hyfforddi mannequin newydd mewn amser file, gan gyflymu ein nod o adeiladu’r mannequin iaith gorau erioed ar gyfer y Gymraeg,” meddai Pontus Stenetorp, yr athro prosesu iaith naturiol a dirprwy gyfarwyddwr y Ganolfan Deallusrwydd Artiffisial yn College Faculty London. “Ein nod yw cymryd y mewnwelediadau a gafwyd o’r mannequin Cymraeg a’u cymhwyso i ieithoedd lleiafrifol eraill, yn y DU ac ar attracts y byd.

Manteisio ar Seilwaith Deallusrwydd Artiffisial Sofran ar gyfer Datblygu Mannequin 

Mae’r mannequin newydd ar gyfer y Gymraeg yn seiliedig ar NVIDIA Nemotron, teulu o fodelau ffynhonnell agored sy’n cynnwys pwysau, setiau knowledge a ryseitiau agored.Mae’r tîm datblygu DU-LLM wedi manteisio ar fodel 49-biliwn-paramedr Llama Nemotron Tremendous a mannequin 9-biliwn-paramedr Nemotron Nano, gan eu hôl hyfforddi ar ddata iaith Gymraeg.

O’i gymharu ag ieithoedd fel Saesneg neu Sbaeneg, mae llai o ddata ffynhonnell ar gael yn y Gymraeg ar gyfer hyfforddiant Deallusrwydd Artiffisial. Felly, er mwyn creu set ddata hyfforddi Cymraeg ddigon mawr, defnyddiodd y tîm ficrowasanaethau NVIDIA NIM ar gyfer gpt-oss-120b a DeepSeek-R1 i gyfieithu setiau knowledge agored NVIDIA gyda dros 30 miliwn o gofnodion o’r Saesneg i’r Gymraeg.

Defnyddion nhw glwstwr GPU drwy blatfform NVIDIA DGX Cloud Lepton ac yn harneisio cannoedd o Uwchsglodion NVIDIA GH200 Grace Hopper ar Isambard-AI — uwchgyfrifiadur mwyaf pwerus y DU, gyda chefnogaeth £225 miliwn o fuddsoddiad gan y llywodraeth ac wedi’i leoli ym Mhrifysgol Bryste — i gyflymu eu llwythi gwaith cyfieithu a hyfforddi.

Mae’r set ddata newydd hon yn ategu knowledge presennol yr iaith Gymraeg o ymdrechion blaenorol y tîm.

Cipio Naws Ieithyddol Gyda Gwerthusiad Gofalus

Mae Prifysgol Bangor, sydd wedi’i lleoli yng Ngwynedd — y sir gyda’r ganran uchaf o siaradwyr Cymraegs — yn cefnogi datblygiad y mannequin newydd gydag arbenigedd ieithyddol a diwylliannol.

Mae Prys, o ganolfan Gymraeg y brifysgol, yn dod â thua dau ddegawd o brofiad gyda thechnoleg iaith ar gyfer y Gymraeg i’r cydweithrediad. Mae ef a’i dîm yn helpu i wirio cywirdeb knowledge hyfforddi a gyfieithir gan beiriannau an information gwerthuso a gyfieithir â llaw, yn ogystal ag asesu sut mae’r mannequin yn ymdrin â naws Gymraeg y mae Deallusrwydd Artiffisial fel arfer yn cael trafferth â nhw — megis y ffordd y mae cytseiniaid ar ddechrau geiriau Cymraeg yn newid yn seiliedig ar eiriau cyfagos.

Disgwylir i’r mannequin, yn ogystal â’r setiau knowledge hyfforddiant a gwerthuso’r Gymraeg, fod ar gael i fentrau a’r sector cyhoeddus eu defnyddio, gan gefnogi ymchwil ychwanegol, hyfforddiant modelu a datblygu rhaglenni.

“Mae’n un peth cael y gallu Deallusrwydd Artiffisial hwn yn bodoli yn y Gymraeg, ond mae’n beth arall ei wneud yn agored ac yn hygyrch i bawb,” meddai Prys. “Gall y gwahaniaeth cynnil hwnnw fod y gwahaniaeth rhwng y dechnoleg hon yn cael ei defnyddio ai peidio.”

Defnyddio Modelau Deallusrwydd Artiffisial Sofran Gyda NVIDIA Nemotron, Microwasanaethau NIM

Gall y fframwaith a ddefnyddiwyd i ddatblygu mannequin DU-LLM ar gyfer y Gymraeg fod yn sylfaen ar gyfer datblygu Deallusrwydd Artiffisial amlieithog ledled y byd.

Mae modelau, knowledge a ryseitiau Nemotron, sy’n cyrraedd y brig, ar gael yn gyhoeddus i ddatblygwyr er mwyn iddynt adeiladu modelau rhesymu sydd wedi’u teilwra i bron unrhyw iaith, parth a llif gwaith. Wedi’u pecynnu fel microgwasanaethau NVIDIA NIM, mae modelau Nemotron wedi’u hoptimeiddio ar gyfer cyfrifiadura cost-effeithiol a rhedeg yn unrhyw le, o liniadur i’r cwmwl.

Bydd mentrau Ewrop yn gallu rhedeg modelau agored, sofran ar y peiriant chwilio Perplexity wedi’i bweru gan Ddeallusrwydd Artiffisial.

Dewch i ddechrau arni gyda NVIDIA Nemotron.

Tags: BringingLanguagesNemotronNVIDIA
Admin

Admin

Next Post
Save $40 on a Handmade Dutch Coffeemaker That is Constructed for Life

Save $40 on a Handmade Dutch Coffeemaker That is Constructed for Life

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Trending.

Safety Amplified: Audio’s Affect Speaks Volumes About Preventive Safety

Safety Amplified: Audio’s Affect Speaks Volumes About Preventive Safety

May 18, 2025
Reconeyez Launches New Web site | SDM Journal

Reconeyez Launches New Web site | SDM Journal

May 15, 2025
Apollo joins the Works With House Assistant Program

Apollo joins the Works With House Assistant Program

May 17, 2025
Discover Vibrant Spring 2025 Kitchen Decor Colours and Equipment – Chefio

Discover Vibrant Spring 2025 Kitchen Decor Colours and Equipment – Chefio

May 17, 2025
Flip Your Toilet Right into a Good Oasis

Flip Your Toilet Right into a Good Oasis

May 15, 2025

TechTrendFeed

Welcome to TechTrendFeed, your go-to source for the latest news and insights from the world of technology. Our mission is to bring you the most relevant and up-to-date information on everything tech-related, from machine learning and artificial intelligence to cybersecurity, gaming, and the exciting world of smart home technology and IoT.

Categories

  • Cybersecurity
  • Gaming
  • Machine Learning
  • Smart Home & IoT
  • Software
  • Tech News

Recent News

Streamline entry to ISO-rating content material modifications with Verisk ranking insights and Amazon Bedrock

Streamline entry to ISO-rating content material modifications with Verisk ranking insights and Amazon Bedrock

September 17, 2025
New Shai-hulud Worm Infecting npm Packages With Hundreds of thousands of Downloads

New Shai-hulud Worm Infecting npm Packages With Hundreds of thousands of Downloads

September 17, 2025
  • About Us
  • Privacy Policy
  • Disclaimer
  • Contact Us

© 2025 https://techtrendfeed.com/ - All Rights Reserved

No Result
View All Result
  • Home
  • Tech News
  • Cybersecurity
  • Software
  • Gaming
  • Machine Learning
  • Smart Home & IoT

© 2025 https://techtrendfeed.com/ - All Rights Reserved