9 agosto 2016

BabelNet: a Wide-Coverage Multilingual Dictionary

BabelNet is the dictionary of the future, it provides the meanings of words with illustrations - and will soon come with videos and animation. It includes entities as well as words, so a search for apple produces results that contain a picture of fruit as well as the famous corporate logo. 

His creator, RobertoNavigli, a computer scientist and associate professor at Sapienza University in Rome, calls it BabelNet after the biblical tower and the technology he believes can bridge the world’s languages. 

The idea is to put a lot of resources together, all the resources that people usually access separately,” he says in an interview published on TimesBabelNet, with 14 million entries and information in 271 languages, is the largest multilingual encyclopedic dictionary and semantic network created by means of the integration of the largest multilingual Web encyclopedia - i.e., Wikipedia - with the most popular computational lexicon of English - i.e., WordNet, and other lexical resources such as Wiktionary, OmegaWiki, Wikidata, Open Multilingual WordNet, Wikiquote, VerbNet, Microsoft Terminology, GeoNames, WoNeF, ImageNet, ItalWordNet, Open Dutch WordNet and FrameNet.

Version 3.7 comes with the following new features:
  1. New resource integrated: FrameNet (lexical units)
  2. More than 2500 Babel synsets identified as key concepts
  3. Mappings with several versions of WordNet now integrated (from 1.6 to 3.0)
  4. More than 2.6 million Babel synsets labeled with domains (was 1,558,806 in v3.6)
  5. Babelnet is, and always will be, free for research purposes, including download. Babelscape, a Sapienza startup company, is BabelNet's commercial support arm, thanks to which the project will be continued and improved over time.

This is how concepts are displayed, one of the most beautiful features of Babelnet


Sources:
BabelNet



2 agosto 2016

What translation would be like if Han Solo didn't exist?

Interview with Jochen Hummel on translation, terminology, and how he created TRADOS.

He is the person every translator has to blame, or be grateful for. Before Jochen Hummel, nobody could ever dare to think that translation would have anything to do with CATs.

It was a great honour for me to interview Jochen and to learn his view on translation industry, terminology and Han Solo...

So, 25 years have passed since you created TRADOS and yet I still don’t see the next big thing ...old system, new makeup. What do you think? It feels great...doesn’t it?

I hope the majority of translators are grateful. At least I know a few who have made big money using TRADOS. That indeed makes me happy. The fact that the basic translation memory architecture hasn’t changed much since TRADOS also makes me feel good. On the other hand, the lack of innovation is a bit scary for the industry. I mean, 25 years is an eternity for software.

There is a joke that the efficiency of the translation technology is greater when the linguists are not involved and indeed your background has nothing to do with translation. I’m sure there is an interesting story behind the creation of Trados...

There surely is. One story nicely illustrates the futility to plan innovation. TRADOS has a powerful function to compute the Translation Memory recycling rate for a given text. It has changed the industry. Till today this feature defines the way translation services are contracted. I had originally coded it as my sales tool, because translators didn’t want to believe me how repetitive their work actually is.

You gave superpowers to terms, turning them into “MultiTerms”. For the first time terminology was machine-readable. Unfortunately, today terminology management is still far from being fully adopted in the daily routine of translators. So what’s wrong with us? What’s wrong with terminology?

The same thing that seems to be wrong with other creative professions: people work under pressure and believe they cannot afford to invest in tasks which only pay off later. Big mistake!

While reading your articles and tweets I often encounter terms like “cross-border”, “interoperable”, “DSM”, they sound like gibberish to me! Let’s keep it simple and sweet, how are you going to break the language barriers now? What’s your plan?

I use EU speak because I am targeting professionals working on these issues. Look, breaking the language barrier is not a simple and sweet task. It’s hard and complex, which is also the reason why biz and eGov shy away from the language challenge, although there’s a lot of money to be made there, and for eGov it’s mandatory. I have only a piece of the plan and it would be too lengthy for here. If you want to learn more, go to the Multilingual Knowledge Blog.

You are active in so many other sectors: you are a startup mentor, you founded Metaversum, a 3D world video game and 3D chat community, and now you even run an affordable art gallery in Berlin, I’m impressed!

Oh, thank you. Yes, I am a generalist and interested in a lot of things. Entrepreneurs get quickly excited, be it a virtual world mirroring real cities or let people experience Berlin subculture in a downtown art gallery[1]. On the other hand, though, it means I am not truly exceptional in anything.
Do you like Social Media? What’s your favourite social media platform? I don’t see CEOs using SM very often...

I like social media, as any entrepreneur and manager should. It’s a perfect way to get direct feedback from outside your cage and even to interact with customers or prospects. I am very curious. I have to be careful not to waste time on SM. Therefore Twitter works best for me.

Have you ever tasted “fauxmage[2]”?? Are you familiar with “Virabhadra[3]”?

I confess, I had to google both terms. No, I haven’t tasted fauxmage. Why should I, if I can have the real thing? And Virabhadra?? Ask me rather about Han Solo[4].

How do you feed your mind? Please share with us your secret! What do you read?

Asking questions and listening carefully. I read a lot. Social Media, news, magazines, science and biz books, novels. But top of the list is definitely SciFi. Always loved it. I am sure that many good, and sometimes premature ideas originated from these stories and movies.

Then I can ask you if Han Solo will come back...
A certain T-800 would answer: He’ll be back!

Jochen Hummel at LT Innovate
Jochen Hummel at LT Innovate

Entrepreneur/director/mentor with coder background, Jochen Hummel founded TRADOS, the world leader in computer-assited translation, and Metaversum, a highly innovative startup combining Web 2.0 and virtual worlds.
He builds global organisations, raises venture capital, involves in M&A on both sides, executive positions in development, sales, and general management, board seats.
Jochen Hummel is CEO of ESTeam AB, a provider of advanced language technology and semantic solutions to EU organisations and corporations. He is founder and CEO of Coreon, the most advanced SaaS solution for multilingual knowledge bases. He serves as chairman of LT-Innovate, the Forum for Europe's Language Technology Industry.
You can follow him on Twitter and on Multilingual Knowledge Blog.






[2] Fake Cheese: blend of "Faux" and "Fromage". It stands for vegan cheese
[3] Virabhadrasana  Warrior 1 is an asana, a yoga pose, commemorating the exploits of a mythical warrior, it means “Great Hero”.
[4] Han Solo is a character in the Star Wars, one of the greatest film heroes.

14 luglio 2016

This is why consistent terminology is crucial for user experience (UX)

The User eXperience (UX) describes the interaction of a user with a website. It refers to the communication between the visual and textual data represented on the screen of the computer and the user. One could say that the UX is ‘the smell of a website’.

How quickly a user can make decisions and how efficient he/she can ‘navigate’ a website depends on various factors which are studied by the developers of the website. The developers’ aim is to create a friendly and easy environment for their consumers by paying attention not only to the images, colours, templates or other attracting visual features of their website but also to the textual representation. That means that UX is about the interface between graphic and content. A user is firstly attracted by the colours, the visual representations and the general sense of the website but to the next and most important level he/she needs to take some information, complete a task and interact with the website. If we imagine a website consisted only of images and colourful boxes it is beyond shadow of doubt that no effective interaction can take place.

How do you interact with this pop-up?


Text, thus, is crucial as it provides the most significant information for the user (e.g. login, payment, donate, cancel, etc.). The user needs the textual data. Nevertheless, the user does not want to think. He/she does not want to spend hours looking for his information or completing a registration or doing an electronic payment. He/she needs efficiency in time and that relies on the accuracy and the consistency of the terms which are used. As Bill Gates had mentioned ‘Content is King’, however as I often highlight ‘Terminology is Queen’.

The text which is represented should be clear, simple, understandable, up to date and based on the perspective of the user. It should not cause any misunderstanding or confusion.



14 giugno 2016

Perché io valgo! Ancora sulla ricerca terminologica in fattura

Ho ricevuto molti commenti relativi all'ultimo post relativo all'integrazione del fattore della ricerca terminologica nella tariffa del traduttore. Tra tutti, il contributo di Elisa Farina mi è piaciuto particolarmente e ho deciso che valeva la pena trasformarlo in un post (previa autorizzazione di Elisa ovviamente!).

Secondo Elisa, sarebbe forse più efficace integrare il fattore della ricerca terminologica nella tariffa a parola. Come ho scritto in uno dei miei commenti su Google+, a volte si investe un'ora nella ricerca del giusto termine equivalente, e questa è una situazione in cui in molti ci ritroviamo spessissimo.



L'ingente dispendio di tempo per le ricerche terminologiche è senza dubbio un handicap per chi calcola il proprio compenso a parola.

Come fare, però, ad inserire questo aspetto nella fattura? Come voce a parte, in linea con quanto proposto da Debora? Ma in che modo? Aggiungendo una tariffa oraria basata su una stima del tempo che si prevede di dedicare alle ricerche? O calcolata a posteriori in base al tempo effettivamente investito?

Entrambe le soluzioni sembrano pericolose. Il più evidente svantaggio della prima opzione è che raramente il traduttore ha tempo di leggere per intero il testo da tradurre in fase di preventivo, quindi difficilmente la stima sarà precisa. Il secondo approccio, invece, rischia di spaventare il cliente (per la mancanza di un preventivo chiuso prima della conferma dell'incarico) o di farlo inviperire (in caso l'importo finale della fattura sia troppo al di sopra delle attese).

Secondo Elisa, quindi, una buona idea potrebbe essere quella di lavorare sull'educazione e sensibilizzazione del cliente ponendo l'accento sull'aspetto terminologico per giustificare l'aumento della tariffa a parola. Rendere consapevole il cliente, in fase di elaborazione del preventivo, delle difficoltà intrinseche nella traduzione del testo, del rischio (anche economico) di una traduzione sbagliata e di una scelta di termini non corretta. E naturalmente, a seconda dei casi, sugli altri aspetti citati da Debora (localizzazione, transcreazione, ecc.). Bisogna insomma spostare il campo di battaglia dall'articolazione della fattura alle trattative pre-preventivo.

Condivido anche il commento della cara Daniela Vellutino, che aggiunge, giustamente, che queste voci di costo dovrebbero essere incluse anche nei lavori dei web curator e dei comunicatori in generale.

E voi cosa ne pensate? Potete postare i vostri commenti su Google+!
22/06/2016: mi potete contattare anche su Facebook! Mi sono iscritta da un paio di giorni....

Ho trovato un post bellissimo post di Allison Wright: The terminological minefield, che vi invito a leggere, anche piu volte! 

Vi riassumo i passaggi che mi sono piaciuti di più: 

La terminologia va ben oltre la corretta scelta del termine equivalente nella lingua di arrivo. E' ben più che selezionare il giusto termine da glossari bilingue creati da altri traduttori o da altre organizzazioni. E' molto più che usare ciò che secondo il tuo sesto senso è la scelta più probabile da una memoria di traduzione creata da terze parti in un CAT.

Ciò che è necessario per utilizzare il termine corretto, è la conoscenza del mondo in cui questi termini appaiono - lo spazio in cui quei termini abitano.

A meno che non si sia esperti e altamente qualificati nel campo oggetto della traduzione, non sarà mai possibile garantire la correttezza terminologica se si va di fretta. La ricerca terminologica richiede tempo, tempo necessario per consultare diverse fonti e tempo per chiedere e ricevere suggerimenti e assistenza da parte dei colleghi.
 

Questa immagine all'interno del suo post, ben riassume il suo punto du vista, e il mio!