A BigG non interessa il linguaggio naturale: “ci interessa il significato delle parole”

L’intervista a Peter Norvig, Director for Search Quality di Google, pubblicata sulla MIT Technology review, ci fa capire due cose:

  1. BigG non crede alle query in linguaggio naturale (NLP)
  2. E’ più facile parlare di NLP quando si sa di cosa si sta parlando

Google, non crede alle query in linguaggio naturale, non pensa, come Powerset, Hakia e qualche altro che l’information retrieval si debba trasformare in question answering per avere successo:

we don’t think it’s a big advance to be able to type something as a question as opposed to keywords. Typing “What is the capital of France?” won’t get you better results than typing “capital of France”

Contemporaneamente però:

We think what’s important about natural language is the mapping of words onto the concepts that users are looking for [...] But understanding how words go together is important. To give some examples, “New York” is different from “York,” but “Vegas” is the same as “Las Vegas,” and “Jersey” may or may not be the same as “New Jersey.”

In effetti è proprio di questo, non di question answering, che si occupano i linguisti: di come certe relazioni di significato sono veicolate da certe espressioni linguistiche (e non da altre).

Tutto sommato è bello sapere che la più grande azienda al mondo che manipola, analizza, indicizza espressioni linguistiche (contenute in alcuni miliardi di documenti creati da alcuni milioni di esseri umani) sa esattamente che cosa sta facendo, e forse sta facendo la cosa giusta.

Nota a margine: Il lancio su slashdot di questa intevista in 24 ore ha accumulato quasi 150 commenti: l’argomento indubbiamente piace.

1 Risposta a “A BigG non interessa il linguaggio naturale: “ci interessa il significato delle parole””



  1. 1 Microsoft non si compra Yahoo, ma si compra Powerset « MediaMeter Trackback su Giugno 27, 2008 alle 8:30 pm

Lascia una Risposta




Subscribe to MediaMeter

RSS Web mining world

  • WolframAlpha: Lots of Potential, Short on Meaning Luglio 10, 2009
    I haven't been formally trained on WolframAlpha nor have I thoroughly investigated it. In fact, I've spent more time reading the hype about it than I have actually kicking the tires. But from the time I've spent, some things are already obvious. First and prominently, WolframAlpha does not rely on semantic technology, neither Semantic Web nor […]
    Array
  • Google's Chrome Operating System: A Revolution in the Making? Luglio 9, 2009
    Too little is known yet about Google's new Chrome OS, but based on the scant information available, it's beginning to look like (a) this might be Google's greatest gauntlet yet, and (b) it just might be a terrific ride for the rest of us as well. The fact is, operating system aficionados (I count myself in that group) have had very little to c […]
    Array
  • Will NYC Go with Google Analytics? Luglio 8, 2009
    I just read an interesting post on Tom Miller's blog summarizing NYC Mayor Bloomberg's keynote note from the Personal Democracy Forum. Miller's take on the speech was that the city may use Google Analytics for optimizing Web site content. From this passage about the Bloomberg presentation, I'd have to agree: The Mayor also announced that […]
    Array
  • Hasso Plattner Calls for In-Memory OLTP Column Stores Luglio 8, 2009
    Former SAP CEO Hasso Plattner has written a paper called A Common Database Approach for OLTP and OLAP Using an In-Memory Column Database, in association with a SIGMOD keynote address.* The approach Plattner advocates is an MPP in-memory column store, presumably somewhat akin to SAP's frequently renamed Business Warehouse Accelerator/Business Intelligenc […]
    Array
  • Summary of Hyperlocal Responses Luglio 8, 2009
    I asked – is Seattle an outlier for concentration of hyperlocal blogging? danah (@zephoria) helped to get the question out in Twitter. Responses included enthusiastic enumerations of cities as well as links to specific hyperlocal content. Take a look at...
    Matthew Hurst
  • Beware: Dynamic Analytics and Static Social Networking May Not Mix Luglio 6, 2009
    As recently reported by Mary Hayes Weier, the enterprise 2.0 social networking geeks at Jive Software (the folks behind the Clearspace wiki platform) have partnered with SAP to make use of BusinessObjects BI on demand offerings. Thus, Jive will enable anyone to embed dynamic analytic widgets into blogs, wikis and discussion forums as shown in this video from […]
    Array
  • Will SharePoint 2007's Success Keep 2010 from Leaving the Station? Luglio 6, 2009
    Those who have studied physics might be familiar with the "story" of the penny and the train. The story goes that if you place a single penny under each wheel of a train, you'll prevent it from moving forward. Essentially, the collective resistance from each penny prevents the train moving forward because it's the same as having the train […]
    Array
  • FeedBurner Help Group Luglio 6, 2009
    FeedBurner Help Group
    Array
  • Is Your City HyperLocal? Luglio 3, 2009
    Seattle resident’s are spoiled for choice when it comes to hyperlocal blogging. In addition to a couple of networks – Next Door Media {My Ballard, PhinneyWood, Magnolia Voice, Queen Anne View, Fremont Universe} and Neighborlogs {Capitol Hill Seattle, Central District...
    Matthew Hurst
  • Interactive Dashboards: US.Gov Aces, Wimbledon Double Faults Luglio 2, 2009
    The US government has unveiled a dashboard to track IT spending, which looks at first sight a shining example of the power and value of business intelligence. The Wimbledon tennis dashboard, on the other hand, appears less than impressive… As reported in The New York Times, the so-called IT Dashboard was put together by the US government in a remarkably sho […]
    Array

mybloglog

Blog Stats

  • 56,784 hits

License