The ALTA Institute

The Institute for Automated Language Teaching and Assessment (ALTA) is funded by Cambridge English Language Assessment to conduct research into improving linguistic proficiency of learners and into automated forms of assessment of language performance. The institute was founded in October 2013 and is administered by the Computer Laboratory. It also includes principal investigators from the Departments of Engineering and Theoretical and Applied Linguistics.

Groups

The Institute comprises researchers from the NLIP group, the DTAL group, the speech group at the machine intelligence laboratory, and the machine learning group.

People

The principal investigators are:

The institute consists of the following research associates, PhD students, and collaborators:

Former Members:

Videos, Links, Software

ALTA Publications (Most open access -- cut and paste to e.g. Google Scholar)

2016

Diane Litman, Steve Young, Mark Gales, Kate Knill, Karen Ottewell, Rogier van Dale and David Vandyke. Towards Using Conversations with Spoken Dialogue Systems in the Automated Assessment of Non-Native Speakers of English. SIGDIAL, 2016.

Marek Rei, Sampo Pyysalo, Gamal K.O. Attending to characters in neural sequence labeling models. COLING, 2016.

Christopher Bryant, Mariano Felice, Ted Briscoe. Automatic Extraction of Learner Errors in ESL Sentences Using Linguistically Enhanced Alignments. COLING, 2016.

Russell Moore, Andrew Caines, Calbert Graham, Paula Buttery. Automated speech-unit delimitation in spoken learner English. COLING, 2016.

A. Herbolet, E. Kochmar. Calling on the classical phone: a distributional model of adjective-noun errors in learners English. COLING, 2016.

Calbert Graham, Paula Buttery, Francis Nolan. Vowels characteristics in the assessment of L2 English pronunciation. INTERSPEECH, 2016.

A. Malinin, R.C. van Dalen, Y. Wang, M.J.F. Gales, K.M. Knill. Off-topic Response Detection for Spontaneous Spoken English Assessment. Association for Computational Linguistics, 2016.

C. Graham, F Nolan, A Caines, P Buttery. Toward an automated pronunciation training system for English L2 learners: theoretical and methodological issues. International Symposium on Applied Phonetics, 2016.

Andrew Caines, Christian Bentz, Calbert Graham, Tim Polzehl, Paula Buttery. Crowdsourcing a multilingual speech corpus: recording, transcription and annotation of the CrowdED Corpus. Proceedings of the Tenth International Conference on Language Resources and Evaluation, 2016.

Ronan Cummins, Meng Zhang,Ted Briscoe. Constrained Multi-Task Learning for Automated Essay Scoring. Association for Computational Linguistics, 2016.

Menglin Xia, Ekaterina Kochmar, Ted Briscoe. Text Readability Assessment for Second Language Learners. 11th Workshop on Innovative Use of NLP for Building Educational Applications, 2016.

Marek Rei, Helen Yannakoudakis. Compositional Sequence Labeling Models for Error Detection in Learner Writing. Association for Computational Linguistics, 2016.

Marek Rei, Ronan Cummins. Sentence Similarity Measures for Fine-Grained Estimation of Topical Relevance in Learner Essays. 11th Workshop on Innovative Use of NLP for Building Educational Applications, 2016.

Zheng Yuan, Ted Briscoe, Mariano Felice. Candidate re-ranking for SMT-based grammatical error correction. 11th Workshop on Innovative Use of NLP for Building Educational Applications, 2016.

Zheng Yuan, Ted Briscoe. Grammatical error correction using neural machine translation. Association for Computational Linguistics, 2016.

Ekaterina Kochmar, Ekaterina Shutova. Cross-Lingual Lexico-Semantic Transfer in Language Learning. Association for Computational Linguistics, 2016.

Dimitrios Alikaniotis, Helen Yannakoudakis, Marek Rei. Automatic Text Scoring Using Neural Networks. Association for Computational Linguistics, 2016.

Ronan Cummins, Helen Yannakoudakis, Ted Briscoe. Unsupervised Modeling of Topical Relevance in L2 Learner Text. 11th Workshop on Innovative Use of NLP for Building Educational Applications, 2016.

2015

Shixiang Gu, Zoubin Ghahramani, Richard E. Turner. Neural Adaptive Sequential Monte Carlo. Advances in Neural Information Processing Systems, 2015.

Nilesh Tripuraneni, Shane Gu, Hong Ge, Zoubin Ghahramani. Particle Gibbs for Infinite Hidden Markov Models. Advances in Neural Information Processing Systems, 2015.

Mariano Felice and Ted Briscoe. Towards a standard evaluation method for grammatical error detection and correction. Proceedings of NAACL-HLT, 2015.

C Graham, A Caines, P Buttery. Phonetic and prosodic features in automated spoken language assessment. Workshop on Phonetic Learner Corpora, International Congress of the Phonetic Sciences (ICPhS), 2015.

C Graham, F Nolan, A Caines, P Buttery. Vowel characteristics in automated assessment. International Phonetics & Phonology in Europe (PaPE) conference, 2015.

Ekaterina Kochmar and Ted Briscoe. Using Learner Data to Improve Error Correction in Adjective--Noun Combinations. Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015.

Russell Moore, Andrew Caines, Calbert Graham and Paula Buttery. Incremental Dependency Parsing and Disfluency Detection in Spoken Learner English. Proceedings of Text, Speech and Dialogue (TSD), 2015.

Marek Rei. Online Representation Learning in Recurrent Neural Language Models. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015.

Helen Yannakoudakis and Ronan Cummins. Evaluating the performance of Automated Text Scoring systems. Proceedings of the 10th workshop on Innovative Use of NLP for Building Educational Applications, NAACL, 2015.

R. C. van Dalen, K. M. Knill, and M. J. F. Gales. Automatically Grading Learners' English Using a Gaussian Process. Proceedings of SLaTE, 2015.

R. C. van Dalen, K. M. Knill, P. Tsiakoulis, and M. J. F. Gales. Improving Multiple-Crowd-Sourced Transcriptions Using a Speech Recogniser. Proceedings of ICASSP, 2015.

2014

Andrew Caines and Paula Buttery. The effect of disfluencies and learner errors on the parsing of spoken learner language. Proceedings of SPMRL-SANCL Workshop, co-located with COLING, 2014.

Ekaterina Kochmar and Ted Briscoe. Detecting Learner Errors in the Choice of Content Words Using Compositional Distributional Semantics. Proceedings of COLING, 2014.

Ng, H.T., S. Wu, Ted Briscoe, C. Hadiwinoto, R. Susanto and C. Bryant. The CoNLL-2014 Shared Task on Grammatical Error Correction. Proceedings of CoNLL, 2014.

Felice and Zheng Yuan. Generating artificial errors for grammatical error correction. Proceedings of the EACL 2014 Student Research Workshop, 2014.

Mariano Felice, Zheng Yuan, Øistein E. Andersen, Helen Yannakoudakis and Ekaterina Kochmar. Grammatical error correction using hybrid systems and type filtering. CoNLL Shared task Proceedings, 2014.

2013

Helen Yannakoudakis. Automated Assessment of English-learner Writing. University of Cambridge, Computer Laboratory, TR-842, 2013.

Øistein E. Andersen, Helen Yannakoudakis, Fiona Barker and Tim Parish. Developing and Testing a Self-Assessment and Tutoring System. In Proceedings of the NAACL 2013 workshop on Innovative Use of Natural Language Processing for Building Educational Applications, 2013.

Helen Yannakoudakis, Gad Lim, Øistein E. Andersen, Ted Briscoe and Fiona Barker. Automatic Writing Assessment and Feedback: An Approach to Improve Construct and Consequential Validity. In Proceedings of the Language Testing Research Colloquium, 2013.

Theodora Alexopoulou, Helen Yannakoudakis and Angeliki Salamoura. Classifying Intermediate Learner English: A Data-driven Approach to Learner Corpora. In S. Granger, G. Gilquin & F. Meunier (eds) Twenty Years of Learner Corpus Research: Looking back, Moving ahead. Corpora and Language in Use – Proceedings 1, Louvain-la-Neuve: Presses universitaires de Louvain, 2013.

Helen Yannakoudakis, Ted Briscoe and Theodora Alexopoulou. Automated Assessment models, Visual User Interfaces, and Second Language Acquisition research: an interdisciplinary perspective. In Language Sciences in the 21st Century: The interdisciplinary challenge, 2013.

Zheng Yuan and Mariano Felice. Constrained grammatical error correction using Statistical Machine Translation. CoNLL Shared task Proceedings, 2013.

Earlier Related Publications

English Profile Programme

NaturalLanguage/ALTA (last edited 2016-10-05 08:54:08 by host109-146-24-111)