Neural networks for the treatment of flexible natural languages (Q3056361)

Modernu IKT risinājumu izveide nav iedomājama bez valodu tehnoloģijām, kuras nodrošina risinājumu pieejamību lietotāja dzimtajā valodā. META-NET veiktais pētījums (Rehm&Uszkoreit, 2013) norāda uz būtiskām atšķirībām valodas tehnoloģiju attīstības līmenī. Pasaules lielajām valodām tiek piešķirts vērtējums “no laba līdz izcilam”, bet valodām ar nelielu runātāju skaitu - “no vidēja līdz zemam”. Šīs plaisas mazināšana ir liels izaicinājums, mūsdienu valodu tehnoloģijās izmantotajām metodēm nepieciešami lieli datu apjumi, kuru īpaši trūkst mazajām fleksīvām valodām ar bagātu morfoloģiju. Viena no jaunākajām, daudzsološākajām tendencēm IKT risinājumu veidošanā ir neironu tīklu tehnoloģiju (NT) izmantošana. NT tiek veiksmīgi izmantoti tādām valodām kā angļu, franču, spāņu un ķīniešu valodām, taču dziļi pētījumi nelielām valodām ar nepietiekamiem valodas resursiem, piemēram, baltu, somu-ugru un slāvu valodām vēl nav veikti. Nepieciešamība mazināt plaisu valodu tehnoloģiju attīstībā un perspektīvā NT tehnoloģijas izmantošana paver lielu iespēju projekta komandai Rīgā, Latvijā. Šis projekts un tā sagaidāmie rezultāti Latvijai ļaus ieņemt vadošo pozīciju valodu tehnoloģiju izpētē un inovatīvu risinājumu radīšanā. Latvija jau šobrīd ir sasniegusi augstu līmeni šajā jomā, to nodrošinājusi cieša akadēmiskās vides un komerciālo struktūru sadarbība. Latvija līdzdarbojas pētniecībā ar citām Eiropas nācijām un ir piemērota valodu tehnoloģiju izpētes un inovāciju reģionālā centra izveidei.Projektā tiks pētīti, analizēti un novērtēti (gan ar automātiskām metodēm, gan manuāli) dažādi NT algoritmi, lai atrastu piemērotākos risinājumus vairākiem nozīmīgiem valodas tehnoloģiju uzdevumiem:- rakstītā teksta apstrādē – tekstu sintaktiskai analīzei, gramatiskuma novērtēšanai un gramatisko kļūdu labošanai (šos komponentus varēs izmantot arī automatizētas tulkošanas, runātās valodas un cilvēka-datora saziņas problēmu risināšanā); - runas tehnoloģijās – efektīvā runas atpazīšanā un runas sintēzē, jaunu datu ieguvē un sagatavošanā, ja pieejamais datu apjoms ir neliels;- automatizētā tulkošanā – labākā dažādu teksta vienību tulkojumu izveidē; automatizētā vārdnīcu izveidē;- cilvēka-datora saziņas modelēšanā – dabiskās valodas sapratnes, dialoga pārvaldības un dabiskās valodas ģenerēšanas uzdevumos. Tiks pētīti arī dialoga modelēšanas, zināšanu izguves un pārvaldības kognitīvie aspekti un to modelēšana neironu tīklos. Tiks pētītas dažādas NT arhitektūras, tostarp dziļie rekursīvie (angliski: deep recurrent), dziļie vienvirziena (angliski: feed-forward) NT, un efektīvi NT apmācības algoritmi, lai risinātu datu skrajuma, morfoloģiskās daudzveidības un resursu nepietiekamības problēmas. Tiks veikta kognitīvo procesu – valodas, vizuālās uztveres, semantiskās apstrādes – empīriska (korelatīva, eksperimentāla) izpēte. Iegūtās zināšanas tiks izmantotas inovatīvu fleksīvo valodu apstrādes metožu radīšanā. Pētījuma teorētiskie rezultāti tiks apkopoti vismaz 15 Web of Science vai SCOPUS datubāzēs iekļauto žurnālu vai konferenču rakstu krājumu publikācijās.Pētījuma teorētiskie rezultāti tiks pārbaudīti laboratoriskos prototipos (TRL 4), kā arī apvienoti prototipā cilvēka-datora saziņai (TRL 5). Veiksmīgākie risinājumi tiks publicēti kā atvērtā koda risinājumi.Plānoto pētījumu rezultātiem ir liels komercializācijas potenciāls, tāpēc pēc projekta beigām turpināsies to attīstīšana un stabilizēšana, lai izveidotu inovatīvus programmproduktus Latvijas un Eiropas tirgum.Projekts ir ar saimniecisko darbību saistīts projekts, tas tiks īstenots sadarbībā starp komersantu (Tilde) un pētniecības institūciju (Latvijas Universitāte) un ilgs no 2016. gada 1. novembra līdz 2019. gada 31. oktobrim. Projekta kopējas izmaksas ir 690 672,13 EUR, t.sk., ERAF atbalsts 484 384,04 EUR. Projekta pētniecības kategorija - eksperimentālā izstrāde un rūpnieciskie pētījumi. Projekts pēc Frascati rokasgrāmatas atbilst IKT zinātnes nozarei “1.2 Computer and information sciences” (Latvian)

0 references

The development of modern ICT solutions is inconceivable without language technologies ensuring the availability of solutions in the user’s mother tongue. The meta-NET study (Rehm&Uszkoreit, 2013) points to significant differences in the level of development of language technologies. The world’s great languages are rated “good to excellent”, while languages with a small number of speakers are rated “from medium to low”. Bridging this gap is a major challenge, the methods used in modern language technologies require large data penetrations, which are particularly lacking for the small flexive languages with rich morphology. One of the most promising developments in ICT solutions is the use of Neural Network Technologies (NT). NTs are successfully used in languages such as English, French, Spanish and Chinese, but deep studies have not yet been carried out in small languages with insufficient linguistic resources, such as white, Finnish-Ugric and Slavic languages. The need to bridge the gap in the development of language technologies and the prospective use of NT technology opens a great opportunity for the project team in Riga, Latvia. This project and its expected results will allow Latvia to take a leading position in the research of language technologies and the creation of innovative solutions. Latvia has already reached a high level in this field, ensured by close cooperation between the academic environment and commercial structures. Latvia participates in research with other European nations and is suitable for the creation of a regional centre for research and innovation of language technologies.The project will explore, analyse and evaluate various NT algorithms (both automatic methods and manually) to find the most suitable solutions for several important tasks of language technologies:- processing of written text – syntactic analysis of texts, evaluation of grammar and correction of grammar errors (these components will also be used for automated translation, spoken language and human-computer communication problems); In speech technologies – effective speech recognition and speech synthesis, acquisition and preparation of new data where the amount of data available is small;- automated translation – for the creation of the best translations of different text units; Automated dictionaries;- human-computer communication modelling – tasks of understanding the natural language, managing dialogue and generating natural language. Cognitive aspects of dialogue modelling, knowledge retrieval and management and their modelling in neural networks will also be studied. Different NT architectures will be studied, including deep recursive (in English: Deep Recurrent), deep one-way (in English: Feature-forward) NT, and efficient NT training algorithms to address data scratching, morphological diversity and resource scarcity problems. Cognitive processes – language, visual perception, semantic processing – empirical (correlative, experimental) will be studied. The acquired knowledge will be used to create innovative flexible language processing methods. The theoretical results of the study will be summarised in the publications of at least 15 journals or conference articles from Web of Science or SCOPUS databases.The theoretical results of the study will be tested in laboratory prototypes (TRL 4), as well as pooled human-computer communication (TRL 5). The most successful solutions will be published as open source solutions.The results of the planned research have great potential for commercialisation, therefore after the end of the project their development and stabilisation will continue in order to create innovative software products for the Latvian and European market.The project is a project related to economic activities, it will be implemented in cooperation between the merchant (Tilde) and research institution (University of Latvia) and will run from November 1, 2016 to October 31, 2019. The total cost of the project is EUR 690672,13, including ERDF support EUR 484384,04. Research category of the project – experimental development and industrial research. The project, according to the Frascati handbook, corresponds to the ICT science industry “1.2 Computer and information sciences” (English)

point in time

15 July 2021

0 references

Le développement de solutions TIC modernes est inconcevable sans les technologies linguistiques qui rendent les solutions disponibles dans la langue maternelle de l’utilisateur. L’étude méta-NET (Rehm&Uszkoreit, 2013) fait état de différences significatives dans le niveau de développement des technologies linguistiques. Les principales langues du monde sont classées «de bonne à excellente» et «moyenne à faible» pour les langues avec un petit nombre de locuteurs. Combler cette lacune est un défi majeur, les méthodes utilisées dans les technologies du langage moderne exigent de grandes quantités de données, qui font particulièrement défaut dans les petites langues flexibles avec une morphologie riche. L’une des tendances les plus récentes et les plus prometteuses en matière de solutions TIC est l’utilisation des technologies de réseau neuronal (NT). Le NT est utilisé avec succès pour des langues telles que l’anglais, le français, l’espagnol et le chinois, mais des recherches approfondies sur les petites langues dont les ressources linguistiques sont insuffisantes, telles que le blanc, le finnois et le slave, n’ont pas encore été menées. La nécessité de combler l’écart dans le développement des technologies linguistiques et l’utilisation future de la technologie NT ouvre une excellente occasion à l’équipe de projet à Riga, en Lettonie. Ce projet et les résultats escomptés permettront à la Lettonie de prendre une position de premier plan dans la recherche des technologies linguistiques et la création de solutions innovantes. La Lettonie a déjà atteint un niveau élevé dans ce domaine, grâce à une coopération étroite entre les universités et les organismes commerciaux. La Lettonie participe à la recherche avec d’autres pays européens et convient à la création d’un centre régional de recherche et d’innovation dans le domaine des technologies linguistiques. Le projet explorera, analysera et évaluera divers algorithmes NT (à la fois automatiques et manuellement) afin de trouver les solutions les plus appropriées à un certain nombre de défis importants en matière de technologie linguistique:- traitement de texte écrit — analyse syntaxique des textes, évaluation grammaticale et correction des erreurs grammaticales (ces composants peuvent également être utilisés pour résoudre les problèmes de traduction automatisée, de langue orale et de communication entre l’homme et l’ordinateur); — dans les technologies de la parole — reconnaissance vocale efficace et synthèse vocale, l’acquisition et la production de nouvelles données lorsque la quantité de données disponibles est faible;- dans la traduction automatisée, la meilleure traduction de différentes unités de texte; création automatisée de vocabulaire;- modélisation de la communication homme-ordinateur — compréhension du langage naturel, gestion du dialogue et tâches de génération de langage naturel. Les aspects cognitifs de la modélisation du dialogue, de l’extraction et de la gestion des connaissances ainsi que de leur modélisation dans les réseaux neuronaux seront également étudiés. Diverses architectures NT, y compris les architectures profondes récursives, seront explorées (en anglais: profonde récurrente), profonde à sens unique (en anglais: en amont) NT, et des algorithmes de formation NT efficaces pour traiter de la numérisation des données, de la diversité morphologique et de la rareté des ressources. Les processus cognitifs — langage, perception visuelle, traitement sémantique — empiriques (correlatifs, expérimentaux) seront étudiés. Les connaissances acquises seront utilisées pour créer des méthodes de traitement linguistique flexibles et innovantes. Les résultats théoriques de l’étude seront recueillis dans au moins 15 publications de revues ou d’articles de conférence inclus dans le Web of Science ou les bases de données SCOPUS. Les résultats théoriques de l’étude seront testés dans des prototypes de laboratoire (TRL 4), ainsi que combinés pour le prototypage de la communication homme-ordinateur (TRL 5). Les solutions les plus réussies seront publiées comme des solutions open source. Les résultats de la recherche planifiée ont un grand potentiel de commercialisation, donc après la fin du projet, ils continueront à être développés et stabilisés afin de créer des produits logiciels innovants pour le marché letton et européen. Le projet est un projet lié à l’activité économique, il sera mis en œuvre en coopération entre le commerçant (Tilde) et l’institution de recherche (Université de Lettonie) et durera du 1er novembre 2016 au 31 octobre 2019. Le coût total du projet s’élève à 690 672,13 EUR, y compris le soutien du FEDER 484 384,04 EUR, catégorie de recherche de projet — développement expérimental et recherche industrielle. Le projet du Manuel Frascati correspond au secteur des TIC «1.2 Informatique et sciences de l’information» (French)

point in time

25 November 2021

0 references

Die Entwicklung moderner IKT-Lösungen ist ohne Sprachtechnologien unvorstellbar, die Lösungen in der Muttersprache des Nutzers zur Verfügung stellen. Die Meta-NET-Studie (Rehm&Uszkoreit, 2013) deutet auf erhebliche Unterschiede im Niveau der Sprachtechnologieentwicklung hin. Die wichtigsten Sprachen der Welt werden als „von gut bis hervorragend“ und „mittel bis niedrig“ für Sprachen mit einer kleinen Anzahl von Sprechern bewertet. Diese Lücke zu überbrücken ist eine große Herausforderung, denn die Methoden der modernen Sprachtechnologien erfordern große Datenmengen, die in den kleinen, flexiblen Sprachen mit reicher Morphologie besonders fehlen. Einer der neuesten und vielversprechendsten Trends bei IKT-Lösungen ist der Einsatz neuronaler Netzwerktechnologien (NT). NT wird erfolgreich für Sprachen wie Englisch, Französisch, Spanisch und Chinesisch verwendet, aber es wurde noch keine gründliche Erforschung kleiner Sprachen mit unzureichenden Sprachressourcen wie weißer, finnischer und slawischer Sprache durchgeführt. Die Notwendigkeit, die Lücke bei der Entwicklung von Sprachtechnologien und dem künftigen Einsatz von NT-Technologien zu überbrücken, eröffnet dem Projektteam in Riga (Lettland) eine große Chance. Dieses Projekt und die erwarteten Ergebnisse werden es Lettland ermöglichen, eine führende Position bei der Erforschung von Sprachtechnologien und der Entwicklung innovativer Lösungen zu übernehmen. Lettland hat in diesem Bereich bereits ein hohes Niveau erreicht, dank der engen Zusammenarbeit zwischen Wissenschaft und Wirtschaft. Lettland beteiligt sich an der Forschung mit anderen europäischen Nationen und eignet sich für die Schaffung eines regionalen Zentrums für Sprachtechnologieforschung und -innovation. Das Projekt wird verschiedene NT-Algorithmen (sowohl automatische Methoden als auch manuell) untersuchen, analysieren und bewerten, um die am besten geeigneten Lösungen für eine Reihe wichtiger sprachlicher technologischer Herausforderungen zu finden:- schriftliche Textverarbeitung – syntaktische Analyse von Texten, grammatikalische Auswertung und Korrektur von grammatikalischen Fehlern (diese Komponenten können auch zur Lösung von Problemen der automatisierten Übersetzung, gesprochener Sprache und Mensch-Computer-Kommunikation verwendet werden); — in Sprachtechnologien – effektive Spracherkennung und Sprachsynthese, die Erfassung und Erstellung neuer Daten, bei denen die verfügbare Datenmenge gering ist;- in automatisierter Übersetzung die beste Übersetzung verschiedener Texteinheiten; automatisierte Vokabular-Erstellung;- Human-Computer-Kommunikationsmodellierung – natürliches Sprachverständnis, Dialogmanagement und natürliche Sprachgenerierung. Auch kognitive Aspekte der Dialogmodellierung, des Wissensabrufs und des Managements und deren Modellierung in neuronalen Netzwerken werden untersucht. Verschiedene NT-Architekturen, darunter tiefe rekursive, werden erkundet (auf Englisch: Tiefwiederholung), tiefe Einbahn (auf Englisch: NT und effektive NT-Trainingsalgorithmen zur Datenabtastung, morphologischer Vielfalt und Ressourcenknappheit. Kognitive Prozesse – Sprache, visuelle Wahrnehmung, semantische Verarbeitung – empirisch (korrelativ, experimentell) werden untersucht. Das erworbene Wissen wird genutzt, um innovative flexible Methoden der Sprachverarbeitung zu schaffen. Die theoretischen Ergebnisse der Studie werden in mindestens 15 Publikationen von Zeitschriften oder Tagungsartikeln in Web of Science oder SCOPUS-Datenbanken gesammelt. Die theoretischen Ergebnisse der Studie werden in Laborprototypen (TRL 4) sowie kombiniert zur Prototypierung der Mensch-Computer-Kommunikation (TRL 5) getestet. Die erfolgreichsten Lösungen werden als Open-Source-Lösungen veröffentlicht. Die Ergebnisse der geplanten Forschung haben ein großes Potenzial für die Kommerzialisierung, so dass sie nach Ende des Projekts weiter entwickelt und stabilisiert werden, um innovative Softwareprodukte für den lettischen und europäischen Markt zu schaffen. Das Projekt ist ein Projekt im Zusammenhang mit der Wirtschaftstätigkeit, es wird in Zusammenarbeit zwischen dem Händler (Tilde) und Forschungsinstitution (Universität Lettland) umgesetzt und wird vom 1. November 2016 bis zum 31. Oktober 2019 laufen. Die Gesamtkosten des Projekts belaufen sich auf 690 672,13 EUR, einschließlich EFRE-Unterstützung 484 384,04 EUR Projektforschung – experimentelle Entwicklung und industrielle Forschung. Das Frascati Handbook-Projekt entspricht dem IKT-Wissenschaftssektor „1.2 Informatik und Informatik“ (German)

point in time

28 November 2021

0 references

De ontwikkeling van moderne ICT-oplossingen is ondenkbaar zonder taaltechnologieën die oplossingen beschikbaar stellen in de moedertaal van de gebruiker. De meta-NET studie (Rehm&Uszkoreit, 2013) wijst op significante verschillen in het niveau van de ontwikkeling van taaltechnologie. „S werelds belangrijkste talen worden beoordeeld „van goed tot uitstekend” en „medium tot laag” voor talen met een klein aantal sprekers. Het overbruggen van deze kloof is een grote uitdaging, de methoden die worden gebruikt in moderne taaltechnologieën vereisen grote hoeveelheden gegevens, die vooral ontbreken in de kleine, flexibele talen met een rijke morfologie. Een van de nieuwste en meest veelbelovende trends in ICT-oplossingen is het gebruik van neurale netwerktechnologieën (NT). NT wordt met succes gebruikt voor talen als Engels, Frans, Spaans en Chinees, maar diep onderzoek naar kleine talen met onvoldoende taalbronnen zoals wit, Fins en Slavische talen is nog niet uitgevoerd. De noodzaak om de kloof in de ontwikkeling van taaltechnologieën en het toekomstige gebruik van NT-technologie te overbruggen, biedt een grote kans voor het projectteam in Riga, Letland. Dit project en de verwachte resultaten ervan zullen Letland in staat stellen een leidende positie in te nemen bij het onderzoek naar taaltechnologieën en het creëren van innovatieve oplossingen. Letland heeft op dit gebied al een hoog niveau bereikt dankzij nauwe samenwerking tussen de academische wereld en commerciële instanties. Letland neemt deel aan onderzoek met andere Europese landen en is geschikt voor de oprichting van een regionaal centrum voor taaltechnologieonderzoek en -innovatie. Het project zal verschillende NT-algoritmen (zowel automatische als handmatige) onderzoeken, analyseren en evalueren om de meest geschikte oplossingen te vinden voor een aantal belangrijke uitdagingen op het gebied van taaltechnologie:- geschreven tekstverwerking — syntactische analyse van teksten, grammaticale evaluatie en correctie van grammaticale fouten (deze componenten kunnen ook worden gebruikt om problemen op het gebied van geautomatiseerde vertaling, gesproken taal en mens-computercommunicatie op te lossen); — in spraaktechnologieën — effectieve spraakherkenning en spraaksynthese, de verwerving en productie van nieuwe gegevens waarbij de hoeveelheid beschikbare gegevens klein is;- in geautomatiseerde vertaling, de beste vertaling van verschillende teksteenheden; geautomatiseerde vocabulaire creatie;- mens-computer communicatie modellering — natuurlijke taal begrijpen, dialoog management en natuurlijke taal generatie taken. Ook de cognitieve aspecten van dialoogmodellering, kennisopvraging en -beheer en hun modellering in neurale netwerken zullen worden onderzocht. Verschillende NT architecturen, waaronder diepe recursieve, zullen worden onderzocht (in het Engels: diep terugkerend), diep eenrichtingsverkeer (in het Engels: feed-forward) NT, en effectieve NT-trainingsalgoritmen om gegevensscanning, morfologische diversiteit en hulpbronnenschaarste aan te pakken. Cognitieve processen — taal, visuele waarneming, semantische verwerking — empirisch (correlatief, experimenteel) zullen worden bestudeerd. De verworven kennis zal worden gebruikt om innovatieve flexibele taalverwerkingsmethoden te creëren. De theoretische resultaten van het onderzoek worden verzameld in ten minste 15 publicaties van tijdschriften of conferentieartikelen die zijn opgenomen in Web of Science of SCOPUS-databases. De theoretische resultaten van de studie worden getest in laboratoriumprototypes (TRL 4) en gecombineerd voor prototyping van mens-computercommunicatie (TRL 5). De meest succesvolle oplossingen zullen worden gepubliceerd als open source oplossingen. De resultaten van het geplande onderzoek hebben een groot potentieel voor commercialisering, dus na het einde van het project zullen ze verder worden ontwikkeld en gestabiliseerd om innovatieve softwareproducten te creëren voor de Letse en Europese markt. Het project is een project met betrekking tot economische activiteit, het zal worden uitgevoerd in samenwerking tussen de handelaar (Tilde) en onderzoeksinstelling (Universiteit van Letland) en zal duren van 1 november 2016 tot 31 oktober 2019. De totale kosten van het project bedragen 690 672,13 EUR, inclusief EFRO-steun 484 384,04 EUR. Projectonderzoek — experimentele ontwikkeling en industrieel onderzoek. Het Frascati Handbook-project komt overeen met de ICT-wetenschapssector „1.2 Computer- en informatiewetenschappen” (Dutch)

point in time

28 November 2021

0 references

Lo sviluppo di soluzioni TIC moderne è inconcepibile senza tecnologie linguistiche che rendano disponibili soluzioni nella lingua madre dell'utente. Lo studio meta-NET (Rehm&Uszkoreit, 2013) evidenzia differenze significative nel livello di sviluppo della tecnologia linguistica. Le principali lingue del mondo sono valutate "da buono a eccellente" e "da medio a basso" per le lingue con un piccolo numero di oratori. Colmare questa lacuna è una sfida importante, i metodi utilizzati nelle moderne tecnologie linguistiche richiedono grandi quantità di dati, che sono particolarmente carenti nei linguaggi piccoli e flessibili con una ricca morfologia. Una delle tendenze più recenti e più promettenti nelle soluzioni TIC è l'uso delle tecnologie di rete neurale (NT). NT è utilizzato con successo per lingue come l'inglese, il francese, lo spagnolo e il cinese, ma non è stata ancora effettuata una profonda ricerca sulle lingue piccole con risorse linguistiche insufficienti come le lingue bianche, finlandesi e slave. La necessità di colmare il divario nello sviluppo delle tecnologie linguistiche e l'uso prospettico della tecnologia NT apre una grande opportunità per il team di progetto a Riga, in Lettonia. Questo progetto e i risultati attesi consentiranno alla Lettonia di assumere una posizione di primo piano nella ricerca delle tecnologie linguistiche e nella creazione di soluzioni innovative. La Lettonia ha già raggiunto un livello elevato in questo settore, grazie a una stretta cooperazione tra il mondo accademico e gli organismi commerciali. La Lettonia partecipa alla ricerca con altre nazioni europee ed è idonea alla creazione di un centro regionale per la ricerca e l'innovazione nel campo delle tecnologie linguistiche. Il progetto esplorerà, analizzerà e valuterà vari algoritmi NT (sia metodi automatici che manualmente) al fine di trovare le soluzioni più adatte a una serie di importanti sfide tecnologiche linguistiche:- elaborazione del testo scritto — analisi sintattica dei testi, valutazione grammaticale e correzione di errori grammaticali (queste componenti possono essere utilizzate anche per risolvere problemi di traduzione automatica, lingua parlata e comunicazione uomo-computer); — nelle tecnologie vocali — riconoscimento vocale efficace e sintesi vocale, acquisizione e produzione di nuovi dati in cui la quantità di dati disponibili è limitata;- nella traduzione automatica, la migliore traduzione di diverse unità di testo; creazione automatica del vocabolario;- modellazione di comunicazione uomo-computer — comprensione del linguaggio naturale, gestione del dialogo e compiti di generazione del linguaggio naturale. Saranno inoltre esplorati gli aspetti cognitivi della modellizzazione del dialogo, del recupero e della gestione delle conoscenze e della loro modellazione nelle reti neurali. Saranno esplorate varie architetture NT, comprese quelle profonde ricorsive (in inglese: profondo ricorrente), profondo a senso unico (in inglese: feed-forward) NT, e algoritmi di addestramento NT efficaci per affrontare la scansione dei dati, la diversità morfologica e la scarsità di risorse. Verranno studiati processi cognitivi — linguaggio, percezione visiva, elaborazione semantica — empirici (correlativi, sperimentali). Le conoscenze acquisite saranno utilizzate per creare metodi innovativi e flessibili di elaborazione del linguaggio. I risultati teorici dello studio saranno raccolti in almeno 15 pubblicazioni di riviste o articoli di conferenza inclusi nelle banche dati Web of Science o SCOPUS. I risultati teorici dello studio saranno testati in prototipi di laboratorio (TRL 4) e combinati per la prototipazione della comunicazione uomo-computer (TRL 5). Le soluzioni di maggior successo saranno pubblicate come soluzioni open source. I risultati della ricerca pianificata hanno un grande potenziale di commercializzazione, quindi dopo la fine del progetto continueranno a essere sviluppati e stabilizzati al fine di creare prodotti software innovativi per il mercato lettone ed europeo. Il progetto è un progetto relativo all'attività economica, sarà attuato in collaborazione tra il commerciante (Tilde) e l'istituto di ricerca (Università di Lettonia) e durerà dal 1º novembre 2016 al 31 ottobre 2019. Il costo totale del progetto è di 690 672,13 EUR, compreso il sostegno FESR 484 384,04 EUR. Categoria di ricerca del progetto — sviluppo sperimentale e ricerca industriale. Il progetto Frascati Handbook corrisponde al settore della scienza delle TIC "1.2 Scienze informatiche e dell'informazione" (Italian)

point in time

11 January 2022

0 references

El desarrollo de soluciones TIC modernas es inconcebible sin tecnologías lingüísticas que hagan que las soluciones estén disponibles en la lengua materna del usuario. El estudio meta-NET (Rehm&Uszkoreit, 2013) apunta a diferencias significativas en el nivel de desarrollo de la tecnología lingüística. Los idiomas más importantes del mundo se clasifican «de bueno a excelente» y «medio a bajo» para los idiomas con un pequeño número de hablantes. Colmar esta brecha es un gran desafío, los métodos utilizados en las tecnologías del lenguaje moderna requieren grandes cantidades de datos, que son particularmente inexistentes en los lenguajes pequeños y flexibles con una rica morfología. Una de las tendencias más recientes y prometedoras en las soluciones de TIC es el uso de las tecnologías de redes neuronales (NT). NT se utiliza con éxito en idiomas como el inglés, el francés, el español y el chino, pero aún no se ha llevado a cabo una investigación profunda sobre lenguas pequeñas con recursos lingüísticos insuficientes como las lenguas blanca, finlandesa y eslava. La necesidad de colmar la brecha en el desarrollo de las tecnologías del lenguaje y el uso prospectivo de la tecnología NT abre una gran oportunidad para el equipo del proyecto en Riga, Letonia. Este proyecto y sus resultados esperados permitirán a Letonia asumir una posición de liderazgo en la investigación de las tecnologías lingüísticas y la creación de soluciones innovadoras. Letonia ya ha alcanzado un alto nivel en este ámbito, gracias a una estrecha cooperación entre el mundo académico y los organismos comerciales. Letonia participa en la investigación con otras naciones europeas y es adecuada para la creación de un centro regional de investigación e innovación en tecnología lingüística. El proyecto explorará, analizará y evaluará varios algoritmos NT (tanto automáticos como manualmente) con el fin de encontrar las soluciones más adecuadas para una serie de retos importantes en materia de tecnología lingüística:- procesamiento de textos escritos — análisis sintáctico de textos, evaluación gramatical y corrección de errores gramaticales (estos componentes también pueden utilizarse para resolver problemas de traducción automatizada, lenguaje hablado y comunicación humano-ordenador); — en tecnologías del habla — reconocimiento efectivo y síntesis del habla, la adquisición y producción de nuevos datos donde la cantidad de datos disponibles es pequeña;- en la traducción automatizada, la mejor traducción de diferentes unidades de texto; creación automatizada de vocabulario;- modelado de comunicación humano-ordenador — comprensión del lenguaje natural, gestión del diálogo y tareas de generación de lenguaje natural. También se explorarán los aspectos cognitivos de la modelización del diálogo, la recuperación y gestión del conocimiento y su modelización en las redes neuronales. Se explorarán varias arquitecturas NT, incluidas las arquitecturas recursivas profundas (en inglés: recidivante profundo), unidireccional profundo (en inglés: feed-forward) NT, y algoritmos efectivos de entrenamiento de NT para abordar el escaneo de datos, la diversidad morfológica y la escasez de recursos. Se estudiarán los procesos cognitivos — lenguaje, percepción visual, procesamiento semántico — empírico (correlativo, experimental). Los conocimientos adquiridos se utilizarán para crear métodos innovadores y flexibles de procesamiento de idiomas. Los resultados teóricos del estudio se recogerán en al menos 15 publicaciones de revistas o artículos de conferencia incluidos en bases de datos Web of Science o SCOPUS. Los resultados teóricos del estudio serán probados en prototipos de laboratorio (TRL 4), así como combinados para la creación de prototipos de comunicación humano-ordenador (TRL 5). Las soluciones más exitosas se publicarán como soluciones de código abierto. Los resultados de la investigación planificada tienen un gran potencial de comercialización, por lo que tras el final del proyecto continuarán siendo desarrollados y estabilizados con el fin de crear productos de software innovadores para el mercado letón y europeo. El proyecto es un proyecto relacionado con la actividad económica, se ejecutará en cooperación entre el comerciante (Tilde) y la institución de investigación (Universidad de Letonia) y tendrá una duración del 1 de noviembre de 2016 al 31 de octubre de 2019. El coste total del proyecto asciende a 690 672,13 EUR, incluida la ayuda del FEDER 484 384,04 EUR. El proyecto Frascati Handbook corresponde al sector de las TIC «1.2 Ciencias de la informática y la información» (Spanish)

point in time

12 January 2022

0 references

location (string)

Vienības gatve 75, Rīga, LV-1004

0 references

Raiņa bulvāris 19, Rīga, LV-1050

0 references

Identifiers

Latvian Kohesio ID

1.1.1.1/16/A/215

0 references

Neural networks for the treatment of flexible natural languages (Q3056361)

Statements

Identifiers

Navigation menu

Search