Archive for Iraila, 2009

word sense disambiguation

Word sense disambiguation is the process used to identify which sense of the word is being used in each sentence, when a word has more than one sense. However, this has some problems.

The first problem is that the different meanings of the words sometimes are much closed, so it is difficult to know which one is being used. Another problem is that these systems are tested by humans, and humans don’t agree which the sense of each word is, so it’s impossible for the computer to know the right answer.

We have two different approaches, deep and shallow.

Deep approaches, give an explanation to each sense of the word, but this is impossible in computer format. Shallow approaches, however, analyses the words of the surroundings and decides which of the different meaning is, but it is a problem if words of more than one sense are arround.

References:

retrieved from wikipedia the free encyclopedia, sep. 06 11:29

Iraila 6, 2009 at 11:56 am Iruzkin bat utzi

CATEGORISATION

Categorisation is to recognize, differentiate and understand ideas. In this process objects that have the same relation are put in categories or groups. There are lots of categorisation techniques but the most general ones are:
*classical categorisation
*conceptual clustering
*prototype theory

Classical categorisation:
This type of categorisation started with Plato, who separates objects based on their similar properties. Then this method was also used by Aristotle, who uses it to separate living beings into groups. In this type of categorisation groups or categories should be defined and each object has to be in one of the groups, no one can be without category.

Conceptual clustering:
In this type of categorisation, first we describe the objects and then, according to their description we classify them. The difference from the classical one is that here, we have one description for each category. Here, objects can belong to more than one category.

Prototype theory:

In prototype theory, some things that are in the same category are more central than others, is more possible to say chair when asked for furniture and not a stool, or an eagle when asked for a bird and not a penguin. This is because we have models for each category.

In prototype categorisation, we have basic level categorisation, that is to say chair instead of kitchen chair or furniture.

References:
* wikipedia the free encyclopedia, article about categorisation, retriebed on sep. 05, 12:30
* wikipedia the free encyclopedia, article about prototype theory, retrieben on sep. 05, 12:50

Iraila 5, 2009 at 1:13 pm Iruzkin bat utzi

ANSWER EXTRACTION

Answer extraction or Question Answering (QA) is a way of information retrieval. When a quantity of documents is given, the system should be able to answer questions written in natural language. QA needs a more complicated technology of natural language processing than other types of document retrieval.

Question answering systems are one of the most complicated systems in the information retrieval, because this system has to find a fragment of text that answers to the question made in natural language. This systems have to recognise questions like who, how, why, ..

A good QA system needs a good search engine that selects the documents that contain the answer. If we are searching in the web, where we have lots of documents, it common to find parts of the answer in different documents, but this has its benefits, because we can choose the answers that appear more.

We have two different methods, deep and shallow.

Shallow: Some methods use keyword techniques to find passages and sentences in documents and filter based on the presence of the desired answer. They made the ranking based on syntactic characteristics like word order.

Deep: Sometimes using keyword searching is not enough, and we need to use the system that include named-entity recognition, word sense disambiguation,… If the question done is why or how, we will also need this system.

References:

retrieved from, wikipedia the free encylcopedia, sep. 05, 10:51

Iraila 5, 2009 at 11:30 am Iruzkin bat utzi

topics list (Q2)

In my opinion, these are the 10 topics that can be more interesting to write about:

• answer extraction
• spell checking
• topic detection
• word sense disambiguation
• speaker recognition
• automatic hyperlinking
• categorisation
• summarisation
• natural language parsing
• morphological analysis

References:

* Language Technology World’s page, retrieved, September 5th, 11:32
http://www.lt-world.org/

Iraila 5, 2009 at 10:00 am Iruzkin bat utzi


Atalak

 

Iraila 2009
As_Astelehena_haste-hitz Ar_Asteartea_haste-hitz Az_Asteazkena_haste-hitz Og_Osteguna_haste-hitz Ol_Ostirala_haste-hitz Lr_Larunbata_haste-hitz I_Igandea_haste-hitz
« Api_Apirila_laburdura    
 123456
78910111213
14151617181920
21222324252627
282930  

RSS Littera Deusto

  • El Mejor Data Mining Maiatza 30, 2012
    Un sistema de la UPV de ayuda a diagnosticar tumores cerebrales, mejor aportación tecnológica en unos premios sanitarios Parece que el uso de la tecnología rompe fronteras, y el caso de Data Mining o Minería de Datos sigue la misma estela. “El sistema CURIAM BT, desarrollado por investigadores del Grupo de Informática Biomédica (IBIME-ITACA) de [...] […]
    Itxaro González
  • Lenguas románicas Maiatza 30, 2012
    LENGUAS ROMANICAS: Laurentino Rodríguez Contreras explica de donde provienen las lenguas románicas: “La verdadera lengua matriz, que dio nacimiento a las lenguas romances, fue… el italiano, pero el italiano no proviene del latín como comúnmente se cree, si no que es, y esto forma parte también de su tesis, una lengua más antigua, desprendida en [...] […]
    Janire Campo
  • Sare sozialen eta identitate digitalen abantailak!! Maiatza 30, 2012
    Gaur egun, Internet ezinbestekoa bilakatu da. Izan ere, edonork dauka eskuragarri eta honen bidez, beharrezko dugun informazioa aurkitu dezakegu. Internet edozein gauzatarako erabil dezakegu, bai jentearekin kontaktuan jartzeko, bai lan mundurako eta bai aisialdi gisa erbailtzeko. Honen barruan, sare sozialak aurkitu ditzakegu. Denbora aurrera joan ahala, sa […]
    Jone Etxeandia
  • Informatika erabiltzen!! Maiatza 30, 2012
    Informatika ordenagailuen bidez egiten den informazioaren tratamendu automatikoa posible egiten duen ezagutza zientifiko eta teknikoen multzoa da. Hitz hau frantsesetik dator, frantsesek sortu baitzuten “informatique”-ren kontzeptua, hau da, informatika. Informatika garatzen joan da denbora aurrera joan ahala gizakiak lan arruntak egin ahal izate […]
    Jone Etxeandia
  • Mendeley: A new good tool for our computers Maiatza 30, 2012
    Mendeley is actually a very sophisticated research management tool and free to use. It has had a great deal of developments since it was invented until now. It was founded in November 2007 and is based in London. The first public beta version was released in August 2008. The team comprises researchers, graduates, and open [...]
    Edurne Sagarna
  • Hego Poloaren konkista Maiatza 30, 2012
    Roald Amundsen norvegiarrak Hego Polora heldu zen orain dela ehun urte, 1911ko abenduaren 14an, eta kontatzeko bueltatu zen. Bere lehiakidea, Robert Falcon Scott britaniarrak , Lurreko puntu australenera heldu zen hilabete bat geroaago baina bueltako bidean, gosea, neketasuna eta temperatura baxuak bera eta bere laguntzaileekin amaitu zuen. Hauek bi espedizi […]
    olatz rementeria
  • Twitter’s beginning Maiatza 30, 2012
    Haven´t you asked yourself about Twitter? Who created it or when it was made? I have found some pages talking about the answers to these questions. Firstly I found Techcrunch that talks about Jack Dorsey, one of the creators of Twitter. In that page we found the comments of Dorsey about the creation of the [...]
    Agueda Ruiz
  • Internet eta sare sozialen arriskuak Maiatza 30, 2012
    Sare sozialek badute bere alde txarra deskribatuta: erabilera desegokia, obsesibo edo irizpide gabekoa, auto-estima arazoak nabarmendu ditzake. Teknologia berrien berehalakotasuna eta ugaltzeko gaitasunak edozein egoera kontrolatzeko zailtasunak ematen ditu. “Marta G. 48 lagun ditu Facebooken”. “Hau 92 pertsonari gustatzen zaie”. “Nire profila gustatzen baza […]
    olatz rementeria
  • Sui Sin Far Maiatza 30, 2012
    Students of Modern Languages ​​2 we had to make some presentations on each character who appear in the book of Aitor Ibarrola “Entre dos mundos”. My group had to make the work about Sui Sin Far, Canadian short story writer, journalist, and essayist. The work was difficult because there is not a lot of information [...]
    Agueda Ruiz
  • Maya Angelou Maiatza 30, 2012
    “Puede que tengas que enfrentarte a muchas derrotas pero nunca debes acabar derrotado”. The hard life of this African-American woman is summarized in this phrase.  She was born in St. Louis (Missouri) the 1928. Her childhood was full of hard times. Her brother and she had to live for a long time with her ​​grandmother [...]
    Agueda Ruiz

Follow

Get every new post delivered to your Inbox.