Семинары CDUD и SCAKD 2016

Совместно с международной конференцией CLA 2016 в московской Вышке проходит семинар по обнаружению понятий в неструктурированных данных (CDUD 2016) и семинар по «мягким» вычислениям и их приложениям (SCAKD 2016). Принятые работы публикуются в материалах CEUR-WS.org (Scopus, dblp).
Читать далее

Рубрика: Конференции, Лекции/Семинары | Добавить комментарий

школа по Digital Humanities

I Московско-Тартуская школа по цифровым гуманитарным исследованиям
4–7 июля 2016
Ясная Поляна, Тульская область
https://ling.hse.ru/digitalhum

#digitalhumanities #compling #datascience #digitaltolstoy

С начала XX века ученые мечтали о точных подходах к анализу текста, но до сих пор основанные на них исследования остаются на периферии как гуманитарных, так и компьютерных наук.
В цифровую эпоху появилось множество методов анализа больших данных. Применение современных компьютерных инструментов решает вопрос фальсификации гуманитарных теорий и служит источником множества интересных идей для специалистов. Анализ художественных текстов – настоящий вызов для современных методов текст-майнинга и больших данных.

Школа лингвистики НИУ ВШЭ и Тартуская лаборатория «СПЖК» при кафедре русской литературы объявляет набор на Московско-тартускую школу по цифровым гуманитарным исследованиям. Участникам предстоит научиться использовать точные методы для анализа больших корпусов текстов, обычно остающихся за пределами исследований, применяющих более традиционную оптику.

Школа пройдет 4–7 июля в Музее-усадьбе Л.Н. Толстого в Ясной поляне в Тульской области. Студенты школы в течение четырех дней будут слушать лекции в области Digital Humanities и работать вместе с профессорами и преподавателями над решением конкретных текстологических задач в рамках одного из тьюториалов.

К участию в школе приглашаются исследователи как c гуманитарным бэкграундом, так и с техническим. Никаких предварительных навыков компьютерного или филологического анализа текста не требуется. Чтобы стать участником школы, необходимо заполнить заявку и написать, почему вам интересно участие в школе и в каком тьюториале вы хотели бы работать. Список тьюториалов доступен на сайте: https://ling.hse.ru/digitalhum/program.

Заявки принимаются по адресу http://goo.gl/forms/nS1obsiu2Y до 1 июня 2016 г. Организаторы отберут участников школы к 10 июня 2016 и оповестят финалистов по электронной почте.

Финалисты конкурса получат грант, покрывающий расходы на проживание и участие во всех включенных в программу школы мероприятиях (грант не покрывает питание и транспортные расходы участников).

Подробнее на сайте https://ling.hse.ru/digitalhum .
Все вопросы о школе можно отправлять руководителю школы Анастасии Бонч-Осмоловской по адресу abonch[собачка]hse.ru

Рубрика: Курсы/Образование/Постдоки | Добавить комментарий

RuSSIR Young Scientist Conference — deadline extended

CALL FOR PAPERS : RuSSIR Young Scientist Conference 2016

RuSSIR Young Scientist Conference 2016 is collocated with the 10th Russian Summer School in Information Retrieval (RuSSIR 2016).

It will be held on August 22-26, 2016 in Saratov, Russia.

http://romip.ru/russir2016/call-for-papers/

=================================================================

The RuSSIR Young Scientist Conference has been created to provide an academic forum that fosters young researchers working on modern problems and methods in information retrieval and related disciplines to share their ideas and results with participants and lecturers of Russian Summer School in Informational Retrieval (RuSSIR). The conference will be held within main RuSSIR program in Saratov, Russia.

We invite submissions of full papers describing innovative research on a wide range of IR-related topics and especially encourage works about semantic search and utilization of knowledge repositories. Topics of interest include but are not restricted to the following:

• IR theory and models;
• Data structures, algorithms, and indexing;
• Web search systems and Web mining;
• Semantic Web and knowledge databases;
• Natural language processing;
• Text Corpora design and application;
• Social networks and graph analysis.

IMPORTANT DATES:

Application deadline: May 15, 2016. May 25, 2016!
Notification: June 19, 2016.
School: August 22-26, 2016.

We invite only original submissions: we will not accept any paper that is under review or has already been accepted for publication in a journal or another conference.

Papers must be in English, up to 12 pages long and formatted according to a Springer template.

IMPORTANT: Submissions should be only made electronically in PDF through EasyChair website.
Accepted papers will be invited for publication in Springer CCIS RuSSIR 2016 proceedings (Springer Communications in Computer and Information Science series). Papers would be presented by means of interactive posters during the conference. At least one of the authors should be less than 30 year old and be able to attend the conference.

Maria Eskevich and Yana Volkovich

RuSSIR Young Scientist Conference 2016 Organising Committee Co-chairs

Рубрика: Конференции, Курсы/Образование/Постдоки | Добавить комментарий

Psycholinguistic Investigations into Number and Quantification in Natural Language

Call Deadline: 15-May-2016

Meeting Description:

The purpose of the workshop is to bring together researchers interested in applying computational techniques to problems in morphology, phonology, and phonetics. Work that addresses orthographic issues is also welcome. Papers will be on substantial, original, and unpublished research on these topics, potentially including strong work in progress. Appropriate topics include (but are not limited to) the following as they relate to the areas of the workshop:

— New formalisms, computational treatments, or probabilistic models of existing linguistic formalisms
— Unsupervised, semi-supervised or machine learning of linguistic knowledge
— Models of psycholinguistic experiments
— Morpheme identification and word segmentation
— Algorithms, including finite-state methods
— Corpus linguistics
— Machine transliteration and back-transliteration
— Speech technologies relating to phonetics or phonology
— Speech science (both production and comprehension)
— Analysis or exploitation of multilingual, multi-dialectal, or diachronic data
— Instructional technologies for second-language learners
— Integration of morphology, phonology, or phonetics with other NLP tasks
— Tools and resources
— Approaches to orthographic variation

One of the missions of SIGMORPHON is to encourage interaction between work in computational linguistics and work in theoretical phonetics, phonology and morphology, and to ensure that each of these fields profits from the interaction. Our recent meetings have been successful in this regard, and we hope to see this continue in 2014. Many mainstream linguists studying phonetics, phonology and morphology are employing computational tools and models that are of considerable interest to computational linguists. Similarly, models and tools developed by and for computational linguists may be of interest to theoretical linguists working in these areas. This workshop provides a forum for these researchers to interact and become exposed to each others’ ideas and research.
Читать далее

Рубрика: Без рубрики | Добавить комментарий

GRAMMAR AND CORPORA

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
SECOND CALL FOR PAPERS for

THE 6th INTERNATIONAL CONFERENCE *GRAMMAR AND CORPORA* (GaC 2016)
Mannheim, Germany
9-11 November 2016
http://gac2016.ids-mannheim.de
Deadline for abstract submission: 31.05.2016
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

The organising committee of *GaC 2016* is pleased to invite
contributions to the sixth
international conference *Grammar and Corpora*, to be held at the
Institute for the German
Language in Mannheim on 9-11 November 2016.

In recent years, the availability of large annotated and searchable
corpora, together
with a new interest in the empirical foundation and validation of
linguistic theory and
description has sparked a surge of novel and interesting work using
corpus methods to
study the grammar of natural languages. However, a look at relevant
current research on
the grammar of German, English, or the Romance and Slavic languages
reveals a variety of
different theoretical approaches and empirical foci which can be
traced back to different
philological and linguistic traditions. Still, this state of affairs
should not be seen as
an obstacle but arguably provides an ideal basis for a fruitful
exchange of ideas between
different research paradigms.

In addition to deepening our knowledge and understanding of individual
languages,
corpus-oriented work on grammar has wider implications that concern
methodological as
well as theoretical aspects. Relevant topics and research questions
concern e.g. annotation
schemata for (larger) syntactic units and syntactic relations, the
increased use of (advanced)
statistical methods and models in linguistics, the relation and
boundary between grammar and
discourse, and more generally the interface between corpus linguistics
and linguistic theory.

We welcome submissions that explore the use of corpus methods in the
description and theoretical
analysis of the grammar of natural languages. Focal areas of interest
include, but are not limited to:

1. Corpus-based studies on the grammar of Germanic, Romance and Slavic
languages:

— The use of (large) corpora in the description of patterns of grammar
from both
a language-specific and a contrastive/cross-linguistic perspective
— The identification and formal modelling of (different types of)
synchronic linguistic variation
using corpus methods
— New insights into the connection between linguistic variation and
change made available
by inspecting ???language change in progress??? in large corpora
— The use of advanced corpus-linguistic and statistical methods in
historical linguistics
as a means to compensate for the relative scarcity of data

2. Theoretical and methodological issues pertaining to corpus-oriented
research on grammar:

— Tools, methods and techniques in corpus assembly, annotation and analysis
— The interaction between corpus linguistics and computational linguistics
— The interaction between corpus linguistics and linguistic theory
— The use of statistical and quantitative methods in detecting
patterns of grammar
— The impact of corpus-based vs. corpus-driven approaches on our
view/understanding of grammar

A subset of these issues will be the focus of five invited keynotes, a poster
session, and four tutorials on multilingual corpora, web corpora,
visualization in R, and
the IDS corpus analysis platform KorAP. Conference languages are
English and German.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
CONFIRMED KEYNOTE SPEAKERS
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Anne Abeill??
(Universit?? Paris Diderot — Paris 7, France)

Susan Conrad
(Portland State University, USA)

Anke Holler
(University of G??ttingen, Germany)

John Nerbonne
(Rijksuniversiteit Groningen, Holland / University of Freiburg, Germany)

Alexandr Rosen
(Charles University Prague, Czech Republic)
Читать далее

Рубрика: Конференции | Добавить комментарий

word2vec проапгрейдили

We are happy to announce the release of our new toolkit “MultiVec” for
computing continuous representations for text at different granularity
levels (word-level or sequences of words). MultiVec includes Mikolov et al.
[2013b]’s word2vec features, Le and Mikolov [2014]’s paragraph vector (batch
and online) and Luong et al. [2015]’s model for bilingual distributed
representations. MultiVec also includes different distance measures between
words and sequences of words. The toolkit is written in C++ and is aimed at
being fast (in the same order of magnitude as word2vec), easy to use, and
easy to extend. It has been evaluated on several NLP tasks: the analogical
reasoning task, sentiment analysis, and crosslingual document
classification. The toolkit also includes C++ and Python libraries, that you
can use to query bilingual and monolingual models.

The project is fully open to future contributions. The code is provided on
the project webpage ( <https://github.com/eske/multivec>
https://github.com/eske/multivec) with installation instructions and
command-line usage examples.

When you use this toolkit, please cite:

@InProceedings{MultiVecLREC2016,

Title                    = {{MultiVec: a Multilingual and MultiLevel
Representation Learning Toolkit for NLP}},

Author                   = {Alexandre Bérard and Christophe Servan and
Olivier Pietquin and Laurent Besacier},

Booktitle                = {The 10th edition of the Language Resources and
Evaluation Conference (LREC 2016)},

Year                     = {2016},

Month                    = {May}

}

The paper is available here:
<https://github.com/eske/multivec/raw/master/docs/Berard_and_al-MultiVec_a_M
ultilingual_and_Multilevel_Representation_Learning_Toolkit_for_NLP-LREC2016.
pdf
>
https://github.com/eske/multivec/raw/master/docs/Berard_and_al-MultiVec_a_Mu
ltilingual_and_Multilevel_Representation_Learning_Toolkit_for_NLP-LREC2016.p
df

Рубрика: Ресурсы/Софт | Добавить комментарий

гранты на поездку на школу по информационному поиску

Autumn School 2016 for Information Retrieval and Information Foraging (ASIRF)
Overview lectures on Advanced topics in Information Retrieval and Information Foraging.

2. – 8. October 2016 in Schloß Dagstuhl, Germany

https://www.uni-hildesheim.de/fb3/institute/iwist/veranstaltungen/asirf2016

The Autumn School for Information Retrieval and Information Foraging 2016 (ASIRF) provides unique opportunies to learn about the the latest developments in the area of Information Retrieval Models, Systems, Evaluation, as well as about Information Interaction and Human Information Foraging Behaviour. And meet other participants working in these areas!

Apply for a comprehensive student grant
Grants for students from outside of Germany will cover the costs for room and board and most of costs for travelling, depending on the rates defined for different countries by the DAAD. Some 20 international students can be invited.

https://www.uni-hildesheim.de/media/fb3/informationswissenschaft/Herbstschule/ASIRF_2016-Application_form.pdf

German students can register at a very competitive price.

ASIRF takes place in the historical environment at the Leibniz Center for Informatics Schloss Dagstuhl — www.dagstuhl.de

One tutor will be (almost sure) Prof. Dr. Norbert Fuhr (University of Duisburg-Essen), a Salton-Award winner.

Topics
— Models
— Evaluation
— Modelling Interactive IR
— Information Behavior
— IR interfaces and user oriented design

ASIRF will also take place in 2017 and it will again offer student grants.

Organisation

Prof. Thomas Mandl & Dr. Ben Heuwing
Universität Hildesheim, Germany
asirf att uni  —  hildesheim  dott  de

Dr. Ingo Frommholz
University of Bedfordshire, UK

German Special Interest Group IR

Рубрика: Гранты, Курсы/Образование/Постдоки | Добавить комментарий

COLING CFP

=============================================================
COLING 2016
The 26th International Conference on Computational Linguistics
——————————————————————————

December 11-16, 2016
Osaka, Japan.

http://coling2016.anlp.jp/ [1]

The International Committee on Computational Linguistics (ICCL) is
pleased to announce the 26th International Conference on Computational
Linguistics (COLING 2016), in Osaka, Japan, at the Osaka International
Convention Center (OICC) (located in Nakanoshima in the center of Osaka).

ABOUT COLING
-­-­-­-­-­-­-­-­-­-­-­—————

The COLING conference has a history that dates back to the 1960s. The
conference is held every two years and regularly attracts more than 700
delegates. The 1st conference was held in New York, 1965. Since then,
the conference has developed into one of the premier Natural Language
Processing conferences worldwide. The last five conferences were held in
Sydney (COLING -­ ACL 2006), Manchester (COLING 2008), Beijing (COLING
2010), Mumbai (COLING 2012), and Dublin (COLING 2014).

COLING covers a broad spectrum of technical areas related to natural
language and computation. The conference will include full papers
(presented as oral presentations or posters), demonstrations, tutorials,
and workshops.
Читать далее

Рубрика: Конференции | Добавить комментарий

Конференция ISMW FRUCT 2016

Intelligence, Social Media and Web (ISMW) — научно-практическая конференция по социальным СМИ, искусственному интеллекту и информационному поиску. Мероприятие пройдёт с 28 августа по 4 сентября 2016 г.

ISMW FRUCT Читать далее

Рубрика: Конференции | Добавить комментарий

First Call for Papers: EMNLP 2016

EMNLP в моей рекламе не нуждается, конечно, но пусть будет для порядка.
First Call for Papers: EMNLP 2016
Conference on Empirical Methods in Natural Language Processing
Austin, Texas, USA
November 2-6, 2016
SIGDAT, the Association for Computational Linguistics’ special interest group on linguistic data and corpus-based approaches to NLP, is pleased to announce that EMNLP 2016 will be held on November 2–6, 2016, in Austin, Texas, USA. The conference includes three days of full paper presentations and invited talks, plus two days consisting of eight workshops and six tutorials.
The conference invites the submission of long and short papers related to empirical methods in natural language processing. Accepted papers will be presented as oral talks or posters. As in recent years, the conference will also include presentations of selected papers accepted by the Transactions of the ACL (http://www.transacl.org/).
This year, EMNLP is collocated with AMTA 2016, hosted by the Association for Machine Translation in the Americas, held from October 29 to November 2. Also in Austin, HCOMP 2016, the 4th AAAI Conference on Human Computation and Crowdsourcing, will be held from October 30 to November 3.
We invite you to join us!
=== Topics ===
We solicit papers on all areas of interest to the SIGDAT community and aligned fields, including but not limited to:
— Computational Psycholinguistics
— Dialogue and Interactive Systems
— Discourse Analysis
— Generation
— Information Extraction
— Information Retrieval and Question Answering
— Language and Vision
— Linguistic Theories and Resources
— Machine Learning
— Machine Translation
— Multilinguality and Cross-linguality
— Segmentation, Tagging, and Parsing
— Semantics
— Sentiment Analysis and Opinion Mining
— Web, Social Media and Computational Social Science
— Spoken Language Processing
— Summarization
— Text Categorization and Topic Modeling
— Text Mining
=== Important Dates ===
Long Paper submission deadline: June 3, 2016
Short Paper submission deadline: June 3, 2016
Author response period: July 13-17, 2016
Acceptance notification: July 29, 2016
Camera-ready submission deadline: September 23, 2016
Workshops and tutorials: November 2 and 6, 2016
Main conference: November 3 — 5, 2016
All the above deadlines are 11:59pm Pacific Daylight Savings Time (UTC -7h).

Читать далее

Рубрика: Конференции | Добавить комментарий