Publications

25th IET Irish Signals & Systems Conference 2014 and 2014 China-Ireland International Conference on Information and Communities Technologies (ISSC 2014/CIICT 2014)

DOI URL

Randomness and the reverberation time, RT $<$ inf, of acoustic responses

Ian J. Kelly, Frank Boland

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

DOI URL

Prediction Quality Assessment

Matjaž Kukar

Conformal Prediction for Reliable Machine Learning , pp. 145--166

DOI URL

Effect of long-term ageing on i-vector speaker verification

David van Leeuwen, Finnian Kelly, Rahim Saeidi, Naomi Harte

InterSpeech 2014

Virtual 5.1 Surround Sound Localization using Head-Tracking Devices

B.C. O'Toole, L. O'Sullivan, Ian J. Kelly, Frank Boland, Marcin Gorzel et al.

25th IET Irish Signals & Systems Conference 2014 and 2014 China-Ireland International Conference on Information and Communities Technologies (ISSC 2014/CIICT 2014)

DOI URL

Assessment of Audio/Video synchronisation in streaming media

François Pitié, Damien Kelly, Thierry Foucu, Naomi Harte, Anil Kokaram

2014 Sixth International Workshop on Quality of Multimedia Experience (QoMEX)

DOI URL

Towards Automated Classification of Seabed Substrates in Underwater Video

Matthew Pugh, Bernard Tiddeman, Hannah Dee, Philip Hughes

2014 ICPR Workshop on Computer Vision for Analysis of Underwater Imagery

DOI URL

Bleed-Through Document Image Restoration

Róisín Rowley-Brooke

Mosaics for Nephrops detection in underwater survey videos

Ken Sooknanan, Jennifer Doyle, Colm Lordan, James Wilson, Anil Kokaram et al.

2014 Oceans - St. John's

DOI URL

Classification of Seabed Type from Underwater Video

Steven Tyner, James Wilson, David Corrigan

Irish Machine Vision and Image Processing Conference (IMVIP)

Automated registration of low and high resolution atomic force microscopy images using scale invariant features

Yun-Feng Wang, Jason I. Kilpatrick, Suzanne Jarvis, Frank Boland, Anil Kokaram et al.

2014 IEEE International Conference on Image Processing (ICIP)

DOI URL

2013

Exploiting randomness in acoustic impulse responses to achieve headphone compensation through deconvolution

Ian J. Kellyand Frank Boland

The Journal of the Acoustical Society of America 133 (5) , vol. 133 , no. 5 , pp. 2778--2787

Depth perception of audio sources in stereo 3D environments

David Corrigan, Marcin Gorzel, John Squires, Frank Boland

Stereoscopic Displays and Applications XXIV

DOI URL

Creaky Voice and the Classification of Affect

Ailbhe Cullen, John Kane, Thomas Drugman, Naomi Harte

Workshop on Affective Social Speech Signals (WASSS)

Late Integration of Features for Acoustic Emotion Recognition

Ailbhe Cullen, Naomi Harte

European Signal Processing Conference (EUSIPCO)

Blotch and scratch removal in archived film using a semi-transparent corruption model and a ground-truth generation technique

Mohamed A Elgharib, François Pitié, Anil Kokaram

Journal on Image and Video Processing , vol. 2013 , no. 1

DOI URL

User-assisted reflection detection and feature point tracking

Mohamed A. Elgharib, François Pitié, Anil Kokaram and Venkatesh Saligrama

Proceedings of the 10th European Conference on Visual Media Production - CVMP '13

DOI URL

Identifying new bird species from differences in birdsong.

Naomi Harte, Sadhbh Murphy, David J. Kelly, Nicola M. Marples

Interspeech , pp. 2900--2904

Detailed comparative analysis of PESQ and VISQOL behaviour in the context of playout delay adjustments introduced by VOIP jitter buffer algorithms

Andrew Hines, Peter Pocta, Hugh Melvin

2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX)

DOI URL

Monitoring the Effects of Temporal Clipping on VoIP Speech Quality

Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte

Interspeech 2013

Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA

Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte

2013 IEEE International Conference on Acoustics, Speech and Signal Processing

DOI URL

Auditory detectability of vocal ageing and its effect on forensic automatic speaker recognition

Finnian Kelly, Naomi Harte

InterSpeech 2013

Eigenageing Compensation for Speaker Verification

Finnian Kelly, Niko Brummer, Naomi Harte

InterSpeech 2013

Exploiting randomness in acoustic impulse responses to achieve headphone compensation through deconvolution

Ian J. Kelly, Frank Boland

The Journal of the Acoustical Society of America , vol. 133 , no. 5 , pp. 2778--2787

DOI URL

Speaker verification in score-ageing-quality classification space

Finnian Kelly, Andrzej Drygajlo, Naomi Harte

Computer Speech $&$ Language , vol. 27 , no. 5 , pp. 1068--1084

DOI URL

The impact of ageing on speech-based biometric systems

Finnian Kelly, Naomi Harte

'Age Factors in Biometric Processing'

Shape Models for Image Segmentation in Microscopy

Kangyu Pan

Adaptive video stabilisation with dominant motion layer estimation for home video and TV broadcast

Félix Raimbault, Yalcin Incesu

2013 IEEE International Conference on Image Processing

DOI URL

User-assisted sparse stereo-video segmentation

Félix Raimbault, François Pitié, Anil Kokaram

Proceedings of the 10th European Conference on Visual Media Production - CVMP '13

DOI URL

A Non-parametric Framework for Document Bleed-through Removal

Róisín Rowley-Brooke, François Pitié, Anil Kokaram

2013 IEEE Conference on Computer Vision and Pattern Recognition

DOI URL

Degraded manuscript restoration: A case study

Róisín Rowley-Brooke, François Pitié, Anil Kokaram

Annual Conference of the Society for Musicology in Ireland (SMI)

Nonrigid recto-verso registration using page outline structure and content preserving warps

Róisín Rowley-Brooke, François Pitié, Anil Kokaram

Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing - HIP '13

DOI URL

Residual Life Prediction of Rotating Machines Using Acoustic Noise Signals

Patricia Scanlon, Darren F. Kavanagh, Frank Boland

IEEE Transactions on Instrumentation and Measurement , vol. 62 , no. 1 , pp. 95--108

DOI URL

Mosaics For Burrow Detection in Underwater Surveillance Video

Ken Sooknanan, Jennifer Doyle, James Wilson, Naomi Harte, Anil Kokaram et al.

Oceans 2013

2012

Phoneme-to-Viseme Mapping for Visual Speech Recognition

Luca Cappelletta, Naomi Harte

International Conference on Patter Recognition Applications and Methods (ICPRAM) , vol. 2 , pp. 322--329

Algorithms for the Digital Restoration of Torn Films

David Corrigan, Anil Kokaram, Naomi Harte

IEEE Transactions on Image Processing , vol. 21 , no. 2 , pp. 573--587

DOI URL

Lower and upper bounds for approximation of the Kullback-Leibler divergence between Gaussian Mixture Models

J.-L. Durrieu, J.-Ph. Thiran, Finnian Kelly

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

DOI URL

Distance Perception in Virtual Audio-Visual Environments

Marcin Gorzel, David Corrigan, Gavin Kearney, John Squires and Frank Boland

25th AES UK Conference: Spatial Audio in Today's 3D World

Improved Speech Intelligibility with a Chimaera Hearing Aid Algorithm

Andrew Hines, Naomi Harte

InterSpeech 2012

Predicting Speech Intelligibility

Andrew Hines

Speech intelligibility prediction using a Neurogram Similarity Index Measure

Andrew Hines, Naomi Harte

Speech Communication , vol. 54 , no. 2 , pp. 306--320

DOI URL

ViSQOL: The Virtual Speech Quality Objective Listener

Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte

International Workshop on Acoustic Signal Enhancement (IWAENC)

Distance Perception in Interactive Virtual Acoustic Environments using First and Higher Order Ambisonic Sound Fields

Gavin Kearney, Marcin Gorzel, Henry Rice, Frank Boland

Acta Acustica united with Acustica , vol. 98 , no. 1 , pp. 61--71

DOI URL

On loudspeaker rendering of auditory distance in higher order Ambisonics

Gavin Kearney, Marcin Gorzel, Frank Boland

Acoustics 2012

On Phase and Randomness in Head Related Impulse Responses

Ian J. Kelly, Frank Boland

9th IMA International Conference on Mathematics in Signal Processing

Speaker verification with long-term ageing data

Finnian Kelly, Andrzej Drygajlo, Naomi Harte

2012 5th IAPR International Conference on Biometrics (ICB)

DOI URL

HRIR Order Reduction Using Approximate Factorization

Claire Masterson, Gavin Kearney, Marcin Gorzel, Frank Boland

IEEE Transactions on Audio, Speech, and Language Processing , vol. 20 , no. 6 , pp. 1808--1817

DOI URL

A wavelet-based Bayesian framework for 3D object segmentation in microscopy

Kangyu Pan, David Corrigan, Jens Hillebrand, Mani Ramaswami and Anil Kokaram

Three-Dimensional and Multidimensional Microscopy: Image Acquisition and Processing XIX

DOI URL

Stereo video completion for rig and artefact removal

Félix Raimbault, François Pitié, Anil Kokaram

2012 13th International Workshop on Image Analysis for Multimedia Interactive Services

DOI URL

Stereo-video inpainting

Félix Raimbault

J. Electron. Imaging , vol. 21 , no. 1 , pp. 011005

DOI URL

A Ground Truth Bleed-Through Document Image Database

Róisín Rowley-Brooke, François Pitié, Anil Kokaram

Theory and Practice of Digital Libraries , pp. 185--196

DOI URL

Bleed-through removal in degraded documents

Róisín Rowley-Brooke, Anil Kokaram

Document Recognition and Retrieval XIX

DOI URL

Improving underwater visibility using vignetting correction

Ken Sooknanan, Anil Kokaram, David Corrigan, Gary Baugh, James Wilson et al.

Visual Information Processing and Communication III

DOI URL

Indexing and selection of well-lit details in underwater video mosaics using vignetting estimation

Ken Sooknanan, Anil Kokaram, David Corrigan, Gary Baugh, Naomi Harte et al.

2012 Oceans - Yeosu

DOI URL

Restoration of high-resolution AFM images captured with broken probes

Y. F. Wang, David Corrigan, C. Forman, Suzanne Jarvis, Anil Kokaram

Three-Dimensional and Multidimensional Microscopy: Image Acquisition and Processing XIX

DOI URL

2011

Motion Estimation for Regions of Reflections through Layer Separation

Mohamed Abdelaziz Ahmed, François Pitié, Anil Kokaram

2011 Conference for Visual Media Production

DOI URL

Reflection detection in image sequences

Mohamed Abdelaziz Ahmed, François Pitié, Anil Kokaram

Cvpr 2011

DOI URL

An Extended Multiresolution Approach to Mouth Specific AAM Fitting for Speech Recognition

Craig Berry, Anil Kokaram, Naomi Harte

European Signal Processing Conference (Eusipco)

Viseme Definitions Comparison for Visual-Only Speech Recognition

Luca Cappelletta, Naomi Harte

European Signal Processing Conference (Eusipco)

Restoration of Image Burnout in 3D-Stereoscopic Media Using Inter-View Gradient Interpolation

David Corrigan, François Pitié, Anil Kokaram

European Signal Processing Conference (Eusipco)

Restoring Image Burnout in 3D-Stereoscopic Media using Temporally Consistent Disparity Maps

David Corrigan, François Pitié, Anil Kokaram

Irish Signals and Systems Conference

Handling Transparency in Digital Video

Mohamed A. Elgharib

On the Perception of Dynamic Sound Sources in Ambisonic Binaural Renderings

Marcin Gorzel, Gavin Kearney, Henry Rice, Frank Boland

AES 41st International Conference

Comparing hearing aid algorithm performance using Simulated Performance Intensity Functions

Andrew Hines, Naomi Harte

Speech perception and auditory disorders, Int. Symposium on Audiological and Auditory Research (ISAAR)

Reproduction of the performance/intensity function using image processing and a computational model (A)

Andrew Hines, Naomi Harte

Int J Audiol , vol. 50 , no. 10 , pp. 723

DOI URL

Simulated performance intensity functions

Andrew Hines, Naomi Harte

2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society

DOI URL

Real-time walkthrough auralisation of the acoustics of Christ Church cathedral Dublin

Gavin Kearney, Marcin Gorzel, Frank Boland, F. Smyth, D. Lennon et al.

Proc of the Institute of Acoustics , vol. 33 , pp. 244--258

Effects of Long-Term Ageing on Speaker Verification

Finnian Kelly, Naomi Harte

Lecture Notes in Computer Science , pp. 113--124

DOI URL

Cellsnake: A new active contour technique for cell/fibre segmentation

Kangyu Pan, Anil Kokaram, Kerry Gilmore, Michael J. Higgins and Robert Kapsa, Gordon G. Wallace

2011 18th IEEE International Conference on Image Processing

DOI URL

Stereo video inpainting

Félix Raimbault, Anil Kokaram

Stereoscopic Displays and Applications XXII

DOI URL

Bleed-Through Removal in Degraded Manuscripts

Róisín Rowley-Brooke, Anil Kokaram

Irish Signals and Systems Conference

Degraded Document Bleed-Through Removal

Róisín Rowley-Brooke, Anil Kokaram

2011 Irish Machine Vision and Image Processing Conference

DOI URL

2010

Semi-automatic motion based segmentation using long term motion trajectories

Gary Baugh, Anil Kokaram

2010 IEEE International Conference on Image Processing

DOI URL

Nostril detection for robust mouth tracking

Luca Cappelletta, Naomi Harte

IET Irish Signals and Systems Conference (ISSC 2010)

DOI URL

A Video Database for the Development of Stereo-3D Post-Production Algorithms

David Corrigan, François Pitié, Valerie Morris, Andrew Rankin, M. Linnane et al.

2010 Conference on Visual Media Production

DOI URL

Evaluating Sensorineural Hearing Loss With An Auditory Nerve Model Using: A Mean Structural Similarity Measure

Andrew Hines, Naomi Harte

European Signal Processing Conference (EUSIPCO '10)

Speech intelligibility from image processing

Andrew Hines, Naomi Harte

Speech Communication , vol. 52 , no. 9 , pp. 736--752

DOI URL

Depth perception in interactive virtual acoustic environments using higher order ambisonic soundfields

Gavin Kearney, Marcin Gorzel, H. Rice, Frank Boland

2nd International Ambisonics and Spherical Acoustics Symposium

A Comparison of Auditory Features for Robust Speech Recognition

Finnian Kelly, Naomi Harte

European Signal Processing Conference (EUSIPCO '10)

Auditory Features Revisited for Robust Speech Recognition

Finnian Kelly, Naomi Harte

2010 20th International Conference on Pattern Recognition

DOI URL

Training GMMs for speaker verification

Finnian Kelly, Naomi Harte

IET Irish Signals and Systems Conference (ISSC 2010)

DOI URL

HRIR Factorisation: A Regularised Approach

C. Masterson, Gavin Kearney, Frank Boland

Euspico 2010 , vol. 2 , pp. 751--755

Optimised virtual loudspeaker reproduction

C. Masterson, Gavin Kearney, Marcin Gorzel, H. Rice, Frank Boland

IET Irish Signals and Systems Conference (ISSC 2010)

DOI URL

Content-Based Media Processing

Deirdre O'Regan

Gaussian mixture models for spots in microscopy using a new split/merge em algorithm

Kangyu Pan, Anil Kokaram, Jens Hillebrand, Mani Ramaswami

2010 IEEE International Conference on Image Processing

DOI URL

Gaussian Mixtures for Intensity Modeling of Spots in Microscopy

Kangyu Pan, Jens Hillebrand, Mani Ramaswami, Anil Kokaram

International Symposium on Biomedical Imaging (ISBI'10) , pp. 121--124

DOI URL

Matting with a depth map

François Pitié, Anil Kokaram

2010 IEEE International Conference on Image Processing

DOI URL

2008

François Pitié, Anil Kokaram, Rozenn Dahyot

CRC Press , pp. 295--321

DOI URL

Publications

2026

The Influence of Binauralizer and HRTF Preprocessing on Objective Loudness in Ambisonics

2025

2024

A Dictionary Based Approach for Removing Out-of-Focus Blur

A Sharpness Based Loss Function for Removing Out-of-Focus Blur

Demystifying the use of Compression in Virtual Production

A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024

Comparative Analysis of Subjective Evaluations for Traditional and Neural-Based Video Enhancement Techniques

Unravelling the Power of Single-Pass Look-Ahead in Modern Codecs for Optimized Transcoding Deployment

2023

Learnable Frontends That Do Not Learn: Quantifying Sensitivity To Filterbank Initialisation

Pushing the Limits of the Wiener Filter in Image Denoising

Fine Grained Spoken Document Summarization Through Text Segmentation

Learnt Deep Hyperparameter Selection in Adversarial Training for Compressed Video Enhancement with a Perceptual Critic

Comparison of HDR quality metrics in Per-Clip Lagrangian multiplier optimisation with AV1

Recommendations for Verifying HDR Subjective Testing Workflows

Subjective Assessment of the Impact of a Content Adaptive Optimiser for Compressing 4K HDR Content With AV1

2022

Learnable Acoustic Frontends in Bird Activity Detection

An Empirical Approach for Optimising the Impact of a Preprocessor in a Transcoding Pipeline

Robo-Identity: Exploring Artificial Identity and Emotion via Speech Interactions

Back to the Future: Extending the Blizzard Challenge 2013

Production characteristics of obstruents in WaveNet and older TTS systems

A Deep Learning post-processor with a perceptual loss function for video compression artifact removal

Direct optimisation of λ for HDR content adaptive transcoding in AV1

2021

Low Resource Species Agnostic Bird Activity Detection

An articulatory study of differences and similarities between stuttered disfluencies and non-pathological disfluencies

Phonetic accommodation in interaction with a virtual language learning tutor: A Wizard-of-Oz study

Synthesizing a Human-like Voice is the Easy Way

Will synthetic speech provide a suitable voice for robots?

Mind your p's and k's -- Comparing obstruents across TTS voices of the Blizzard Challenge 2013

CNN-Based Video Codec Classifier For Multimedia Forensics

A differentiable estimator of VMAF for Video

Near optimal per-clip lagrangian multiplier prediction in hevc

Per-clip and per-bitrate adaptation of the Lagrangian multiplier in video coding

Liaison and Pronunciation Learning in End-to-End Text-to-Speech in French

2020

A Bayesian View of Frame Interpolation and a Comparison with Existing Motion Picture Effects Tools

Can Auditory Nerve models tell us what's different about WaveNet vocoded speech?

Investigation of Auditory Nerve Model Based Analysis for Vocoded Speech Synthesis

Per-clip adaptive Lagrangian multiplier optimisation with low-resolution proxies

Per Clip Lagrangian Multiplier Optimisation for HEVC

Should robots have accents?

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

2019

Articulatory behaviour during disfluencies in stuttered speech

2018

2017

A No-Reference Video Quality Predictor For Compressed Videos

Towards predicting dialog acts from previous speakers non-verbal cues

2016

Anatomy from the outside in: a new on-line surface anatomy guide

The ADAPT entry to the Blizzard Challenge 2016

2015

Direct optimisation of $λ$ for HDR content adaptive transcoding in AV1