The ALTA Institute

The Institute for Automated Language Teaching and Assessment (ALTA) is funded by Cambridge English Language Assessment to conduct research into improving linguistic proficiency of learners and into automated forms of assessment of language performance. The institute was founded in October 2013 and is administered by the Computer Laboratory. It also includes principal investigators from the Departments of Engineering and Theoretical and Applied Linguistics.

Groups

The Institute comprises researchers from the NLIP group, the DTAL group, the speech group at the machine intelligence laboratory, and the machine learning group.

People

The principal investigators are:

The institute consists of the following research associates, PhD students, and collaborators:

Former Members:

Videos, Links, Software

ALTA Publications (Most open access -- cut and paste to e.g. Google Scholar)

2017

Christopher Bryant, Mariano Felice, and Ted Briscoe. Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction. Association for Computational Linguistics, 2017.

Andrew Caines. Spoken CALL Shared Task system description. Proceedings of SLaTE, 2017.

Andrew Caines, Michael McCarthy and Paula Buttery. Parsing transcripts of speech. Proceedings of SCNLP, 2017.

Andrew Caines, Emma Flint and Paula Buttery. Collecting fluency corrections for spoken learner English. Proceedings of BEA, 2017.

Xie Chen, Xunying Liu, Anton Ragni, Yu Wang, Mark Gales. Future Word Contexts in Neural Network Language Models. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2017.

Xie Chen, Anton Ragni, Xunying Liu, Mark Gales. Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition. INTERSPEECH, 2017.

Youmna Farag, Marek Rei, Ted Briscoe. An Error-Oriented Approach to Word Embedding Pre-Training. 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017.

Emma Flint, Elliot Ford, Olivia Thomas, Andrew Caines and Paula Buttery. A text normalisation system for non-standard English words. Proceedings of WNUT, 2017.

Graham, C. & Post, B. Second Language Acquisition of intonation: Peak alignment in English. Journal of Phonetics, 65, 160-174. 2017.

Graham, C., Caines, A., Buttery, P., Moore, R., Nolan, F. Automatic detection and feedback for lexical stress errors in non-native English. International Symposium on Applied Phonetics. 2018.

Kate Knill, Mark Gales, Konstantinos Kyriapolous, Anton Ragni, Yu Wang. Use of Graphemic Lexicons for Spoken Language Assessment. INTERSPEECH, 2017.

Konstantinos Kyriakopoulos, Kate Knill, Mark Gales. Automatic characterisation of the pronunciation of non-native English speakers using phone distance features. SLaTE, 2017.

Andrey Malinin, Kate Knill, Anton Ragni, Yu Wang, Mark Gales. An attention based model for off-topic spontaneous spoken response detection: An Initial Study. SLaTE, 2017.

Andrey Malinin, Anton Ragni, Mark Gales, Kate Knill.Incorporating Uncertainty into Deep Learning for spoken language assessment. Association for Computational Linguistics, 2017.

Marek Rei. Semi-supervised Multitask Learning for Sequence Labeling. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017.

Marek Rei and Helen Yannakoudakis. Auxiliary Objectives for Neural Error Detection Models. In Proceedings of the 12th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications, 2017.

Marek Rei, Mariano Felice, Zheng Yuan and Ted Briscoe. Artificial Error Generation with Machine Translation and Syntactic Patterns. In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017.

Marek Rei. Detecting Off-topic Responses to Visual Prompts. In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017.

Marek Rei, Luana Bulat, Douwe Kiela and Katia Shutova. Grasping the Finer Point: A Supervised Similarity Network for Metaphor Detection. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017.

Ekaterina Shutova, Andreas Wundsam and Helen Yannakoudakis. Semantic frames and visual scenes: Learning semantic role inventories from image and video descriptions. In Proceedings of the 6th Joint Conference on Lexical and Computational Semantics: *SEM, 2017.

Helen Yannakoudakis, Marek Rei, Øistein E. Andersen and Zheng Yuan. Neural Sequence-Labelling Models for Grammatical Error Correction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2017.

2016

Diane Litman, Steve Young, Mark Gales, Kate Knill, Karen Ottewell, Rogier van Dale and David Vandyke. Towards Using Conversations with Spoken Dialogue Systems in the Automated Assessment of Non-Native Speakers of English. SIGDIAL, 2016.

Marek Rei, Sampo Pyysalo, Gamal K.O. Attending to characters in neural sequence labeling models. COLING, 2016.

Christopher Bryant, Mariano Felice, Ted Briscoe. Automatic Extraction of Learner Errors in ESL Sentences Using Linguistically Enhanced Alignments. COLING, 2016.

Russell Moore, Andrew Caines, Calbert Graham, Paula Buttery. Automated speech-unit delimitation in spoken learner English. COLING, 2016.

A. Herbolet, E. Kochmar. Calling on the classical phone: a distributional model of adjective-noun errors in learners English. COLING, 2016.

Calbert Graham, Paula Buttery, Francis Nolan. Vowels characteristics in the assessment of L2 English pronunciation. INTERSPEECH, 2016.

A. Malinin, R.C. van Dalen, Y. Wang, M.J.F. Gales, K.M. Knill. Off-topic Response Detection for Spontaneous Spoken English Assessment. Association for Computational Linguistics, 2016.

C. Graham, F Nolan, A Caines, P Buttery. Toward an automated pronunciation training system for English L2 learners: theoretical and methodological issues. International Symposium on Applied Phonetics, 2016.

Andrew Caines, Christian Bentz, Calbert Graham, Tim Polzehl, Paula Buttery. Crowdsourcing a multilingual speech corpus: recording, transcription and annotation of the CrowdED Corpus. Proceedings of the Tenth International Conference on Language Resources and Evaluation, 2016.

Ronan Cummins, Meng Zhang,Ted Briscoe. Constrained Multi-Task Learning for Automated Essay Scoring. Association for Computational Linguistics, 2016.

Menglin Xia, Ekaterina Kochmar, Ted Briscoe. Text Readability Assessment for Second Language Learners. 11th Workshop on Innovative Use of NLP for Building Educational Applications, 2016.

Marek Rei, Helen Yannakoudakis. Compositional Sequence Labeling Models for Error Detection in Learner Writing. Association for Computational Linguistics, 2016.

Marek Rei, Ronan Cummins. Sentence Similarity Measures for Fine-Grained Estimation of Topical Relevance in Learner Essays. 11th Workshop on Innovative Use of NLP for Building Educational Applications, 2016.

Zheng Yuan, Ted Briscoe, Mariano Felice. Candidate re-ranking for SMT-based grammatical error correction. 11th Workshop on Innovative Use of NLP for Building Educational Applications, 2016.

Zheng Yuan, Ted Briscoe. Grammatical error correction using neural machine translation. Association for Computational Linguistics, 2016.

Ekaterina Kochmar, Ekaterina Shutova. Cross-Lingual Lexico-Semantic Transfer in Language Learning. Association for Computational Linguistics, 2016.

Dimitrios Alikaniotis, Helen Yannakoudakis, Marek Rei. Automatic Text Scoring Using Neural Networks. Association for Computational Linguistics, 2016.

Ronan Cummins, Helen Yannakoudakis, Ted Briscoe. Unsupervised Modeling of Topical Relevance in L2 Learner Text. 11th Workshop on Innovative Use of NLP for Building Educational Applications, 2016.

2015

Shixiang Gu, Zoubin Ghahramani, Richard E. Turner. Neural Adaptive Sequential Monte Carlo. Advances in Neural Information Processing Systems, 2015.

Nilesh Tripuraneni, Shane Gu, Hong Ge, Zoubin Ghahramani. Particle Gibbs for Infinite Hidden Markov Models. Advances in Neural Information Processing Systems, 2015.

Mariano Felice and Ted Briscoe. Towards a standard evaluation method for grammatical error detection and correction. Proceedings of NAACL-HLT, 2015.

C Graham, A Caines, P Buttery. Phonetic and prosodic features in automated spoken language assessment. Workshop on Phonetic Learner Corpora, International Congress of the Phonetic Sciences (ICPhS), 2015.

C Graham, F Nolan, A Caines, P Buttery. Vowel characteristics in automated assessment. International Phonetics & Phonology in Europe (PaPE) conference, 2015.

Ekaterina Kochmar and Ted Briscoe. Using Learner Data to Improve Error Correction in Adjective--Noun Combinations. Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015.

Russell Moore, Andrew Caines, Calbert Graham and Paula Buttery. Incremental Dependency Parsing and Disfluency Detection in Spoken Learner English. Proceedings of Text, Speech and Dialogue (TSD), 2015.

Marek Rei. Online Representation Learning in Recurrent Neural Language Models. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015.

Helen Yannakoudakis and Ronan Cummins. Evaluating the performance of Automated Text Scoring systems. Proceedings of the 10th workshop on Innovative Use of NLP for Building Educational Applications, NAACL, 2015.

R. C. van Dalen, K. M. Knill, and M. J. F. Gales. Automatically Grading Learners' English Using a Gaussian Process. Proceedings of SLaTE, 2015.

R. C. van Dalen, K. M. Knill, P. Tsiakoulis, and M. J. F. Gales. Improving Multiple-Crowd-Sourced Transcriptions Using a Speech Recogniser. Proceedings of ICASSP, 2015.

2014

Andrew Caines and Paula Buttery. The effect of disfluencies and learner errors on the parsing of spoken learner language. Proceedings of SPMRL-SANCL Workshop, co-located with COLING, 2014.

Ekaterina Kochmar and Ted Briscoe. Detecting Learner Errors in the Choice of Content Words Using Compositional Distributional Semantics. Proceedings of COLING, 2014.

Ng, H.T., S. Wu, Ted Briscoe, C. Hadiwinoto, R. Susanto and C. Bryant. The CoNLL-2014 Shared Task on Grammatical Error Correction. Proceedings of CoNLL, 2014.

Felice and Zheng Yuan. Generating artificial errors for grammatical error correction. Proceedings of the EACL 2014 Student Research Workshop, 2014.

Mariano Felice, Zheng Yuan, Øistein E. Andersen, Helen Yannakoudakis and Ekaterina Kochmar. Grammatical error correction using hybrid systems and type filtering. CoNLL Shared task Proceedings, 2014.

2013

Helen Yannakoudakis. Automated Assessment of English-learner Writing. University of Cambridge, Computer Laboratory, TR-842, 2013.

Øistein E. Andersen, Helen Yannakoudakis, Fiona Barker and Tim Parish. Developing and Testing a Self-Assessment and Tutoring System. In Proceedings of the NAACL 2013 workshop on Innovative Use of Natural Language Processing for Building Educational Applications, 2013.

Helen Yannakoudakis, Gad Lim, Øistein E. Andersen, Ted Briscoe and Fiona Barker. Automatic Writing Assessment and Feedback: An Approach to Improve Construct and Consequential Validity. In Proceedings of the Language Testing Research Colloquium, 2013.

Theodora Alexopoulou, Helen Yannakoudakis and Angeliki Salamoura. Classifying Intermediate Learner English: A Data-driven Approach to Learner Corpora. In S. Granger, G. Gilquin & F. Meunier (eds) Twenty Years of Learner Corpus Research: Looking back, Moving ahead. Corpora and Language in Use – Proceedings 1, Louvain-la-Neuve: Presses universitaires de Louvain, 2013.

Helen Yannakoudakis, Ted Briscoe and Theodora Alexopoulou. Automated Assessment models, Visual User Interfaces, and Second Language Acquisition research: an interdisciplinary perspective. In Language Sciences in the 21st Century: The interdisciplinary challenge, 2013.

Zheng Yuan and Mariano Felice. Constrained grammatical error correction using Statistical Machine Translation. CoNLL Shared task Proceedings, 2013.

Earlier Related Publications

English Profile Programme

NaturalLanguage/ALTA (last edited 2017-10-18 09:56:05 by host81-158-249-105)