Computational Linguistics Group
University of Vigo
Computational Linguistics Links
INDEX
Site administrator
- Introductions, directories, associations and publications
- Corpus linguistics
- Grammatical analysis, modeling and parsing
- Computational lexicology and lexicography
- Human-computer interaction
- Speech recognition and synthesis
- Machine translation and machine-aided translation
- Computer-aided language learning
- Information retrieval and document processing
- Integration of language technology resources
- Appendix: Software in Galician and R&D
![]()
Xavier Gómez Guinovart, 2/12/09
INTRODUCTIONS, DIRECTORIES, ASSOCIATIONS AND PUBLICATIONS
Introductions to computational linguistics
General collections of links
- [pdf] Manual de lingüística computacional (Xavier Gómez Guinovart)
- [pdf] Lingüística y tecnologías del lenguaje (Joaquim Llisterri)
- NLP FAQ: Natural Language Processing Frequently Asked Questions (Dragomir R. Radev)
- WWW Info on Computational Linguistics/Language Technology (Hans Uszkoreit)
- La ingeniería lingüística en la sociedad de la información (M. A. Martí e J. Llisterr)
- The NLP Dictionary (Bill Wilson)
Collections of links for Galician
- CLUK: Computational Linguistics UK (Martin Stower)
- The ACL NLP/CL Universe Association for Computational Linguistics
- Interactive online CL Demos (Martin Volk)
- Linguateca
- Natural Language Software Registry
Associations
- Lista de recursos de língua galega (Ciberirmandade da Fala)
- Recursos en galego e de lingua galega na rede (Calidonia)
- Recursos lingüísticos (Deputación da Coruña)
- Ferramentas lingüísticas (Mancomún)
- O galego nas novas tecnoloxías (Secretaría Xeral de Política Lingüística)
- De bolina (Consello da Cultura Galega)
- Recursos de galego (IES de Mos)
- En galego (Universitat de Barcelona)
- Recursos sobre a lingua galega (Loia.org)
Online publications and papers
- The Association for Computational Linguistics (ACL)
- Spanish Association for Natural Language Processing (SEPLN)
- Association for Computers and the Humanities (ACH)
- Spanish Association for Human-Computer Interaction (AIPO)
- ACL Wiki for Computational Linguistics (ACL)
- Procesamiento del Lenguaje Natural (SEPLN)
- ELSNews (ELSNET quarterly newsletter)
- Novatica (ATI, Asociación de Técnicos de Informática)
- Journal of Artificial Intelligence Research (JAIR)
- Artificial Intelligence (Elsevier)
- Revista Digital d'Humanitats (Universitat Oberta de Catalunya)
- Spanish Association for Natural Language Processing Electronic Newsletter (SEPLN)
- Revista Tradumàtica: Traducció i Tecnologies de la Informació (Universitat Autònoma de Barcelona)
- Translation Journal (Accurapid)
- Linguamática (Universidade do Minho e Universidade de Vigo)
Go to the top
CORPUS LINGUISTICS
Electronic texts
On-line corpora
- Project Gutenberg
- The Oxford Text Archive
- CETH - Directory of Electronic Text Centers
- The EServer: Accessible Online Publishing
- IPL Online Text Collection (Internet Public Library)
- BiVir - Virtual Library (Universal Literature in Galician language)
- Galician Virtual Library (Galician Literature)
- Vercial Project (Portuguese Literature)
Text analysis software and corpora catalogues
- Corpus Lingüístico da Universidade de Vigo (CLUVI) (University of Vigo Parallel Corpus of Galician)
- Tesouro Informatizado da Lingua Galega (TILGA) (Galician corpus)
- Corpus de Referencia do Galego Actual (CORGA)
- Corpus Técnico do Galego (CTG) (University of Vigo Technical Corpus of Galician)
- Tesouro Medieval Informatizado da Lingua Galega (TMILG)
- Arquivo Sonoro de Galicia - [ASG] - A nosa fala (Galician speech corpus)
- RILG: Recursos Integrados da Lingua Galega
- Acesso a corpora de português: Projecto AC/DC
- Corpus do Português [Davies/Ferreira]
- CQP Online Demos (Stuttgart Univ.)
- Concordances of Great Books
- British National Corpus (BNC)
- Collins WordbanksOnline English
- Lampeter Corpus of Early Modern English
- Corpus Textual Informatitzat de la Llengua Catalana
- Corpus del Español histórico y moderno (Mark Davies, Universidade de Illinois)
- Corpus de Referencia del Español Actual (CREA)
- Korpus 2000 (Danish corpus)
- The IPI PAN Corpus (Polish corpus)
- Corpus Paralelo COMPARA (Portuguese-English)
- NAT-QI: NATools Corpora Query Interface (pt-en,fr,es)
- OPUS - an open source parallel corpus
- CRATER Multilingual Aligned Annotated Corpus
- Penn Treebank Online(online interface to several Penn Treebank parsed corpora)
Corpus encoding standards and tagsets
- Text Analysis Computing Tools (TACT for MS-DOS, v. 2.1.5, 1995, free distribution)
- MonoConc (Athelstan)
- European Language Resources Association (ELRA)
- Linguistic Data Consortium (LDC)
Documents and collections of links
- TEI Consortium
- TEI Guidelines for Electronic Text Encoding and Interchange (P4)
- Corpus Encoding Standard (Eagles, EU)
- SGML (Joseba Abaitua)
- Recommendations for the Morphosyntactic Annotation of Corpora (Eagles, UE)
- Synopsis and Comparison of Morphosyntactic Phenomena Encoded in Lexicons and Corpora (Eagles, UE)
- The SLI Tagset for Galician Morphosyntactic Annotation of Corpora (SLI, Universidade de Vigo)
- Una propuesta de codificación morfosintáctica para corpus de referencia en lengua española (Aurora Martín de Santa Olalla)
- [pdf] Lingüística de Corpus: Histórico e Problemática (Tony Berber Sardinha)
- Corpus Linguistics (McEnery & Wilson)
- The London-Lund Corpus of Spoken English (Sidney Greenbaum & Jan Svartvik) (ICAME)
- London/Oslo/Bergen (LOB) Corpus Manual (Stig Johansson) (ICAME)
- The tagged LOB Corpus Users' Manual (Stig Johansson) (ICAME)
- The Helsinki Corpus of English Texts Manual (Merja Kytö) (ICAME)
- Archives of CORPORA (Text Corpora Distribution List)
- [pdf] Tratamiento de corpora bilingües (Joseba Abaitua)
- Corpora y otros depósitos textuales
- Corpus resources: Corpora and electronic text databases (Emily M. Bender)
- Gateway to Corpus Linguistics on the Internet (Yvonne Breyer)
- Well-known and influential corpora: A survey (R. Xiao)
Go to the top
GRAMMATICAL ANALYSIS, MODELING AND PARSING
On-line and interactive systems
Grammar models and formalisms
- Galician Freeling - Automatic linguistic analysis of Galician
- Freeling - An Open Source Suite of Language Analyzers (English, Spanish, Catalan, Italian, Galician)
- ENGCG: Constraint Grammar Parser of English
- Spanish morphological parser
- TnT -- Statistical Part-of-Speech Tagging
- U. of Illinois Part of Speech Tagging Online Demonstration
- Shallow Parsing Online Demonstration
- Link Grammar English Parser
- WinBrill (CNRS)
- TreeTagger
- Linguistic Demos - Xerox XRCE (tokenization, morphological analysis and disambiguation for Portuguese, Spanish, English, French, German and more)
Documents and collections of links
- PC-PATR (official home page)
- Lexical Functional Grammar (U. of Essex)
- Lexical Functional Grammar. The Stanford Web Site.
- LFG for Windows (Avery Andrews)
- HPSG Homepage
- The AGFL Grammar Work Lab
- Introduction to GB model
- Survey of the State of the Art of Human Language Technology: Language Analysis and Understanding
- A Web of On-line Grammars
- Online Resources for Studying English Syntax
- [pdf] Foundations of statistical natural language processing
Go to the top
COMPUTATIONAL LEXICOLOGY AND LEXICOGRAPHY
On-line dictionaries
Lexical formalisms and computational lexicons
- Dicionario de Galego Ir Indo
- Vocabulario Ortográfico da Lingua Galega (VOLGa) (VOLGa)
- Diccionario da Real Academia Galega (RAG)
- e-Estraviz - Dicionário da Língua Galego-Portuguesa
- Termoteca - Terminological Databank of the University of Vigo
- Neoteca - Neologisms Databank of the University of Vigo
- English-Galician CLUVI Dictionary
- Galician-German Dictionary (RAG)
- Diccionario de Galego (CRTVG)
- RILG: Recursos Integrados da Lingua Galega
- Dicionário Universal da Língua Portuguesa (Texto Editora e Priberam Informática)
- The Wordsmyth Educational Dictionary-Thesaurus
- Merriam-Webster Dictionary & Thesaurus
- ARTFL Project: Webster's Revised Unabridged Dictionary
- Gran diccionari de la llengua catalana (Enciclopèdia catalana)
- Diccionarios.com (Vox)
- ASL Dictionary
- ARTFL Project: Roget's Thesaurus, 1911 edition
- Diccionario Euskara-Inglés Morris Student Plus (Imaxin Software)
- Dicionário de Caboverdiano Português On-Line (Verbalis e Priberam)
- Glosario de Internet: Galego-inglés-castelán-portugués
- [pdf] Léxico panlatino de Internet. Catalán-español-francés-galego-italiano-portugués-romanés
- Fraseoloxía xurídico-administrativa: galego-alemán-inglés-castelán
- Vocabulario das artes gráficas (obradoiro de encadernación): galego-español-inglés
- Dicionario visual da construción
- Dicionario da vestimenta tradicional galega
- [pdf] Vocabulario de Química. Galego-español-inglés
- Dicionario Galego da Construción Naval
- Vocabulario de ecoloxía e ciencias ambientais
- IATE (Inter-Active Terminology for Europe) (repertorio terminolóxico da Unión Europea)
- Cercaterm do Termcat
- Neoloteca do Termcat
- Euskalterm: euskara-castelán-francés-inglés-latín
- RhymeZone
- Dictionnaires d'autrefois
Documents and collections of links
- The DATR Web Pages at Sussex
- The DATR Lexical Knowledge Representation Language
- WordNet Home Page
- WordNet 1.6 Interface at Kyoto
- EuroWordNet Home Page
- Web EuroWordNet Interface (v. 0.2)
- Wordnet MCR Interface Meaning
- The Acquilex projects
- Survey of the State of the Art of Human Language Technology: Language Resources
- Morphosyntactic Phenomena Encoded in Lexicons and Corpora (EAGLES Recommendations)
- ACL SIGLEX (Special Interest Group on the Lexicon)
- Consortium for Lexical Research
- Search Engine Plugins for Mozilla browsers
- Andel de traballos terminográficos do SNL-USC
- Diccionarios e glosarios (O Traductor Virtual)
- Lexicool - Directory of bilingual and multilingual dictionaries
- The Linguist List: Dictionaries
- List of Dictionaries
- yourDictionary.com
- Página de la Lengua Española: Diccionarios
- Diseño e implementación de un lexicón computacional para lexicografía y traducción automática (Antonio Moreno Ortiz)
- Explotación de los córpora textuales informatizados para la creación de bases de datos terminológicas basadas en el conocimiento (Chantal Pérez Hernández)
- Individuación e información Parte-Todo: Representación para el procesamiento computacional del lenguaje. (Salvador Climent Roca)
Go to the top
HUMAN-COMPUTER INTERACTION
Companies and products
On-line and interactive systems
Documents and collections of links
- BotSpot: ChatBots
- ELIZA
- Eliza en JAVA
- Zabaware - Bring Out the Person in your PC!
- The Simon Laven Page
- Asociación Interacción Persona-Ordenador
- Survey of the State of the Art of Human Language Technology: Discourse and Dialogue
- ACL SIGDIAL (Special Interest Group on Dialogue)
- ACM SIGCHI (Special Interest Group on Computer-Human Interaction)
Go to the top
SPEECH RECOGNITION AND SYNTHESIS
Companies and products
On-line and interactive systems
- IBM ViaVoice
- Dragon Systems
- Philips
- Lernout & Hauspie (Kurzweil)
- SpeechDepot (Unisys)
Documents and collections of links
- Cotovía (text-to-speech system for Galician and Spanish)
- Loquendo (síntese de fala do galego)
- Text-to-Speech Demo for Spanish (UPM)
- Interactive Multi-Lingual Demo (ATT Labs)
- Transcriptor fonético automático del español
- Speech at Carnegie Mellon University
- comp.speech FAQ (Frequently Asked Questions)
- Survey of the State of the Art of Human Language Technology: Spoken Language Input
- Survey of the State of the Art of Human Language Technology: Spoken Output Technologies
- Incorporación de conocimientos fonéticos a las tecnologías del habla (L. Aguilar, J.M. Garrido e J. Llisterri)
Go to the top
MACHINE TRANSLATION AND MACHINE-AIDED TRANSLATION
On-line and interactive systems
Companies and products
- Free Online Translation Spanish-Galician Galician-Spanish
- Tradutor español<->galego OpenTrad
- Tradutor inglés<->galego OpenTrad
- Tradutor español-galego da Secretaría Xeral de Política Lingüística
- Google Translate
- FreeTranslation.com
- Systran
- PROMT
- Voilà (Systran)
- AltaVista (Systran)
- FoxLingo toolbar for Mozilla browsers (interface to free online translation services for 37 languages)
- AJAX Translator (italiano, castelán, portugués, alemán, francés, inglés)
- OpenTrad (español-catalán, galego, euskara)
- Opentrad apertium prototype (español-catalán, galego, portugués)
- interNOSTRUM (Spanish-Catalan)
- AutomaticTrans (castelán-catalán, galego, portugués, francés, inglés)
- Translendium (castelán, catalán, alemán, francés, inglés, galego)
- N-II: a statistical machine translator (Spanish-Catalan)
- Proxecto Rianxo (Galician-Spanish)
- Traduce Gratis - Systran
- NewsTran: Multilingual MetaNews Translator (news via AltaVista)
- TextCat Language Guesser (Van Noord)
- Reconhecedor automático de idiomas (Insite)
- Atril Déjà Vu (translation memory, with demo)
- WordFast (translation memory, with demo)
- SALT (Spanish-Catalan freeware)
- Star Transit (translation memory)
- TRADOS Translator's Workbench (translation memory)
- PAHO - Machine Translation at PAHO (ACG/T)
- Traductor automático Traduza-g (Spanish-Galician)
- Babylon (PC-Internet)
- Xanadu (PC-Internet)
Documents and collections of links
- Machine Translation at Carnegie Mellon University
- European Association for Machine Translation (EAMT)
- Translation, theory and technology
- Quince años de traducción automática en España (J. Abaitua)
- Survey of the State of the Art of Human Language Technology: Multilinguality
- Traducción automática: presente e futuro (J. Abaitua)
- [pdf] Automation of Translation: Past, Presence, and Future (Karl-Heinz Freigang)
- Links to translation resources (N. Andión)
Go to the top
COMPUTER-AIDED LANGUAGE LEARNING
On-line and interactive systems
Documents and collections of links
- Hot Potatoes (U. of Victoria freeware)
- Galingua (curso de galego)
- é-galego (curso de galego)
- Curso de galego (Dorinda Castro Soliño e Rosa Gullón González)
- Etnonet 2.0 - Enciclopedia Interactiva da Cultura Galega (Xosé Antón Vicente Rodríguez - CEIP O Piñeiriño)
- Português para nós (Curso multimédia de português para galegos)
- The Virtual CALL Library (U. of Sussex at Brighton)
- Authoring
- Computer-Assisted Language Learning: an Introduction (Warschauer, M.)
Go to the top
INFORMATION RETRIEVAL AND DOCUMENT PROCESSING
On-line and interactive systems
Documents and collections of links
- Corrector ortográfico en liña de galego (SLI)
- Corrector ortográfico de galego para MS Office
- Corrector ortográfico de galego para OpenOffice
- Corrector gramatical de galego para OpenOffice
- Corrector ortográfico de galego para Mac OS X
- Corrector de linguaxe sexista do galego (Tagenata)
- Corrector ortográfico do português FLiP (Priberam)
- WebJspell (portugués, inglés) (Univ. do Minho)
- JSpell OnLine Spell Checker (English, Spanish, French)
- Hyphenation for Galician (Mancomún)
- Cloze Readability Test (Readability of a Financial Privacy Policy)
- SweSum - Automatic Text Summarize (Hercules Dalianis and Martin Hassel)
- TwentyOne demonstrator (multilingual retrieval from European documents about local sustainability)
- Google search and translate
- Web-based question answering system (MIT)
XML & HTML Manuals
- Oportunidades para la ingeniería lingüística (Piek Vossen)
- Sistemas de resumen automático de documentos (Salvador Climent)
- Information Retrieval & Natural Language Processing (Felisa Verdejo, Julio Gonzalo e Anselmo Peñas, UNED)
- ReGra (corrector gramatical para portugués de Brasil)
- How to write plain English (Rudolf Flesch)
- Measuring the reading age of books and other reading matter (Chris & Keith Johnson)
- Everything you ever wanted know about readability tests but were afraid to ask
- Search Engine Watch
- Search Engine Showdown
- Survey of the State of the Art of Human Language Technology: Document Processing
- A gentle introduction to XML
- Extensible Markup Language (XML) 1.0 (Second Edition) (W3C)
- The XML FAQ
- The XML Cover Pages
- Writing HTML
- Webmonkey
- HTML Home Page (World Wide Corsortium - W3C)
Go to the top
INTEGRATION OF LANGUAGE TECHNOLOGY RESOURCES
- RILG: Integrated Language Resources for Galician
- CLARIN-ES
- CLARIN: Common Language Resources and Technologies
Go to the top
APPENDIX: SOFTWARE IN GALICIAN AND NLP R&D
Software in Galician
NLP Research and Development
- Windows XP en galego
- Windows Vista en galego
- MS Office 2003 en galego
- MS Office 2007 en galego
- Linux Trisquel en galego
- Linux Galinux (versión orixinal) en galego
- Linux Galinux (versión de Mancomún) en galego
- OpenOffice en galego
- Firefox en galego
- Thunderbird en galego
- A Internet en galego: o dominio .gal
- Portal institucional de software libre
- Tradución de software libre ao galego
- Programas gratis en galego
- Programas libres en galego para Windows
- Programas libres en galego para Linux
- Programas libres en galego para Mac
Go to the top
Thank you for your interest