Publications
CDT students are shown in bold in the list below.
2023
R Flynn, A Ragni (2023). Leveraging Cross-Utterance Context For ASR Decoding. Proceedings of INTERSPEECH 2023, Dublin, Ireland, 20-24 August 2023.
S Vincent, R Flynn, C Scarton (2023). MTCUE: Learning Zero-Shot Control of Extra-Textual Attributes by Leveraging Unstructured Context in Neural Machine Translation. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada, July 9-14, 2023.
G Huang, RK Moore (2023). Using social robots for language learning: are we there yet? Journal of China Computer-Assisted Language Learning.
T Goldsack, Z Lou, Q Xie, C Scarton, M Shardlow, S Ananiadou, C Lin (2023). Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles. Proceedings of the 22nd Workshop on Biomedical Language Processing co-located at the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada, July 9-14, 2023.
E Gow-Smith, D Sánchez Villegas (2023). Sheffield's Submission to the AmericasNLP Shared Task on Machine Translation into Indigenous Languages. Proceedings of Third Workshop on NLP for Indigenous Languages of the Americas, co-located at the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada, July 9-14, 2023. Best-performing submission overall to the AmericasNLP 2023 Shared Task. Code and models available.
JA Sivakumar, NS Moosavi (2023). FERMAT: An Alternative to Accuracy for Numerical Reasoning. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada, July 9-14, 2023.
W Ravenscroft, S Goetze, T Hain (2023). On Data Sampling Strategies for Training Neural Network Speech Separation Models. Proceedings of the 31st European Signal Processing Conference (EUSIPCO 2023), Helsinki, Finland.
H Yusufali, S Goetze, RK Moore (2023). Bridging the Communication Rate Gap: Enhancing Text Input for Augmentative and Alternative Communication (AAC). Proceedings of the 25th International Conference on Human-Computer Interaction (HCII) 2023, Copenhagen Denmark.
G Close, T Hain, S Goetze (2023) PAMGAN+/-: Improving Phase-Aware Speech Enhancement Performance via Expanded Discriminator Training. Proceedings of the 154th Audio Engineering Society (AES) Convention, Espoo, Helsinki, Finland, May 13–15, 2023. Winner of the Student Technical Paper Award.
T Loakman, C Tang, C Lin (2023). TwistList: Resources and Baselines for Tongue Twister Generation. Proceedings of the 61st Conference of the Association for Computational Linguistics (ACL 2023, Toronto, Short Papers)
C Tang, H Zhang, T Loakman, C Lin and F Guerin (2023). Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation. Proceedings of the 61st Conference of the Association for Computational Linguistics (ACL 2023, Toronto, Long Papers)
T Pickard, T Loakman, and M Pandya (2023). shefnlp at SemEval-2023 Task 10: Compute-Efficient Category Adapters. Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Toronto, Canada. Association for Computational Linguistics.
A Savary, C Ben Khelil, C Ramisch et al. (including T Pickard) (2023). PARSEME corpus release 1.3. Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023), Dubrovnik, Croatia, May 2023.
M Thomas, S Hollands, D Blackburn, H Christensen (2023). Towards disfluency features for speech technology based automatic dementia classification. Proceedings of the 20th International Congress of the Phonetic Sciences (ICPhS 2023), Prague Congress Centre, Czech Republic, 7-11 August 2023.
M Thomas, N Pevy, T Walker (2023). Disfluencies and Cognitive Decline: An Investigative Study. Proceedings of the biennial symposium of the International Clinical Phonetics and Linguistics Association (ICPLA), University of Salzburg, 4-7 July 2023.
M Hewitt, T Rodríguez Muñoz, G Huang (2023). Melodica: An Affordable Music Companion. In Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction (HRI '23). Association for Computing Machinery, New York, NY, USA, 806–809.
C Tang, H Zhang, T Loakman, C Lin, F Guerin (2023). Terminology-aware Medical Dialogue Generation. Proceedings of the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023).
S Ellis, S Goetze, H Christensen (2023). Moving Towards Non-Binary Gender Identification Via Analysis of System Errors in Binary Gender Classification. Proceedings of the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023).
W Ravenscroft, S Goetze, T Hain (2023). Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation. Proceedings of the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023).
G Close, W Ravenscroft, T Hain, S Goetze (2023). Perceive and predict: self-supervised speech representation based loss functions for speech enhancement. Proceedings of the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023).
T Goldsack, Z Zhang, C Lin, C Scarton (2023). Domain-driven and Discourse-guided Scientific Summarisation. Proceedings of the 45th European Conference on Information Retrieval (ECIR 2023).
2022
E Gow-Smith, H Tayyar Madabushi, C Scarton, A Villavicencio (2022). Improving Tokenisation by Alternative Treatment of Spaces. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022).
T Goldsack, Z Zhang, C Lin, C Scarton (2022). Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022).
H Huang, C Tang, T Loakman, F Guerin, C Lin (2022). Improving Chinese Story Generation via Awareness of Syntactic Dependencies and Semantics. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 178–185. Association for Computational Linguistics.
C Tang, Z Zhang, T Loakman, C Lin, F Guerin (2022). NGEP: A Graph-based Event Planning Framework for Story Generation. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 186–193. Association for Computational Linguistics.
G Huang, Roger K. Moore (2022). Is honesty the best policy for mismatched partners? Aligning multi-modal affordances of a social robot: An opinion paper. Frontiers in Virtual Reality, 16 September 2022.
T Rodríguez Muñoz, E Ip, G Huang, R K Moore (2022). Interactivism in Spoken Dialogue Systems. Proceedings of the 26th Workshop on the Semantics and Pragmatics of Dialogue (SemDial 2022), August, 22-24, 2022, Dublin, pp 263-265.
W Ravenscroft, S Goetze, T Hain (2022). Utterance Weighted Multi-Dilation Temporal Convolution Networks for Monaural Speech Dereverberation. Proceedings of the 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022), Bamberg, Germany.
S Hollands, D Blackburn, H Christensen (2022). Evaluating the Performance of State-of-the-Art ASR Systems on Non-Native English using Corpora with Extensive Language Background Variation. Proceedings of INTERSPEECH 2022 (Incheon, Korea).
G Close, S Hollands, S Goetze, T Hain (2022). Non-intrusive Speech Intelligibility Metric Prediction for Hearing Impaired Individuals. Proceedings of INTERSPEECH 2022 (Incheon, Korea).
S Vincent, L Barrault, C Scarton (2022). Controlling Extra-Textual Attributes about Dialogue Participants: A Case Study of English-to-Polish Neural Machine Translation. Proceedings of the 23rd Annual Conference of the European Association for Machine Translation (EAMT 2022).
S Vincent, L Barrault, C Scarton (2022). Controlling Formality in Low-Resource NMT with Domain Adaptation and Re-Ranking: SLT-CDT-UoS at IWSLT2022. Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022).
D Phelps, X Fan, E Gow-Smith, H Tayyar Madabushi, C Scarton, A Villavicencio. Sample Efficient Approaches for Idiomaticity Detection. Proceedings of the 18th Workshop on Multiword Expressions, colocated with LREC 2022 (Marseille, France).
E Gow-Smith, M McConville, W Gillies, J Scott, R Ó Maolalaigh. Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore. Proceedings of the 4th Celtic Language Technology Workshop, colocated with LREC 2022 (Marseille, France).
G Close, T Hain, S Goetze (2022). MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data. Proceedings of 30th European Signal Processing Conference (EUSIPCO 2022).
W Ravenscroft, S Goetze, T Hain (2022). Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation. Proceedings of 30th European Signal Processing Conference (EUSIPCO 2022).
X Ao, D Sánchez Villegas, D Preoţiuc-Pietro, N Aletras (2022). Combining Humor and Sarcasm for Improving Political Parody Detection. Proceedings of NAACL 2022.
TAF Green, D Maynard, C Lin (2020). Development of a Benchmark Corpus to Support Entity Recognition in Job Descriptions. Proceedings of LREC 2022.
H Tayyar Madabushi, E Gow-Smith, M Garcia, C Scarton, M Idiart, A Villavicencio (2022). SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding. 16th International Workshop on Semantic Evaluation (SemEval-2022) co-located with NAACL.
W Ravenscroft, S Goetze and T Hain (2022). Att-TasNet: Attending to Encodings in Time-Domain Audio Speech Separation of Noisy, Reverberant Speech Mixtures. Frontiers in Signal Processing: Signal Processing Theory.
A Alajrami and N Aletras (2022). How does the pre-training objective affect what large language models learn about linguistic properties? Proceedings of ACL 2022
M Thomas (2022). 'Speech and the Effects of the Comorbidity of Depression and MCI'. Proceedings of Sheffield Dementia Conference 2022.
2021
H Tayyar Madabushi, E Gow-Smith, C Scarton and A Villavicencio (2021). AStitchInLanguageModels: Dataset and methods for the exploration of idiomaticity in pre-trained language models. Findings of EMNLP 2021.
D Sánchez Villegas and N Aletras (2021). Point-of-interest type prediction using text and images. Proceedings of EMNLP 2021.
D Sánchez Villegas, S Mokaram, and N Aletras (2021). Analysing online political advertisements. Proceedings of ACL Findings 2021.
S Vincent (2021). Towards personalised and document-level machine translation of dialogue. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop.
P Vickers, N Aletras, E Monti and L Barrault (2021). In factuality: Efficient integration of relevant facts for visual question answering. Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021).
P Vickers, R Wainwright, H Tayyar Madabushi and A Villavicencio (2021). CogNLP-Sheffield at CMCL 2021 shared task: Blending cognitively inspired features with transformer-based language models for predicting eye tracking patterns. Proceedings of the Cognitive Modelling and Computational Linguistics (CMCL) Workshop 2021, 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL).
2020
A Maronikolakis, D Sanchez Villegas, D Preoţiuc-Pietro, N Aletras (2020). Analysing political parody in social media. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 4373-4384.
D Sánchez Villegas, D Preoţiuc-Pietro and N Aletras (2020). Point-of-interest type inference from social media text. Proceedings of the First Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (AACL).