una difficile googleata
E’ proprio vero che chi si preoccupa di distribuire il proprio prodotto in tutto il mondo deve fare i conti con le più diverse etimologie: Daphne Dembo(Google search engineer) prova in questo articolo a condividere i punti cruciali della distribuzione di google search nell’universo mondo, sottolineando i punti critici di questa *conquista* della semantica (.. o almeno ci prova!!):
…Our first step in providing search support for a language is to train our language model on a large collection of documents in that language. This ensures that our language model is more precise and comprehensive — for example, it incorporates names, idioms, colloquial usage, and newly coined words not often found in static dictionaries. For instance, we recently started identifying Swahili, and used pages such as this one for the Parliament of Tanzania to train our system with the language’s nuances. Having a trained language model helps to categorize documents during crawling and indexing of the web and to parse the user’s query… read the full version
| Print article | This entry was posted by rp on November 24, 2008 at 2:37 pm, and is filed under Research, Technology. Follow any responses to this post through RSS 2.0. You can leave a response or trackback from your own site. |
