Research Fellow in Generative Audio AI

University of Surrey


View All Vacancies
Imaginative and prescient, Speech & Sign Processing
Location:  Guildford
Wage: 
£36,024 to £38,205
each year
Publish Kind:  Full Time
Closing Date: 
23.59 hours BST on Sunday 21 April 2024
Reference:  013724

The College of Surrey is a worldwide neighborhood of concepts and other people, devoted to life-changing training and analysis. 

We’re bold and have a daring imaginative and prescient of what we wish to obtain – shaping ourselves into among the finest universities on this planet, which we’re reaching via the skills and endeavour of each worker.  

Our tradition empowers folks to attain this goal and to collectively, and individually, make an actual distinction.  

 The function

Functions are invited for a Analysis Fellow (RF) place for 12 months inside the Centre for Imaginative and prescient Speech and Sign Processing (CVSSP), College of Surrey, UK, to work within the space of generative AI for audio era with textual content and video prompts. 

 The submit is funded by a number one generative AI startup. The main target shall be to develop generative machine studying fashions and sign processing algorithms for sound era, given prompts from textual content and/or video.  This work is constructed on the current contributions of CVSSP within the generative AI fashions for audio era, corresponding to AudioLDM and Re-AudioLDM, with a concentrate on scaling up the fashions with extra datasets and increasing the fashions to incorporate extra modalities corresponding to video. 

 The post-holder shall be based mostly in CVSSP, and work underneath the path of the Principal Investigator Prof Wenwu Wang, with co-supervision by Prof Mark Plumbley, and in collaboration with the economic associate.

 About you

The post-holder is predicted to have a PhD diploma (or equal) within the space of machine studying, generative AI, acoustic sign processing, cross-modal processing amongst audio, textual content and video, or a associated space in digital engineering, utilized arithmetic, pc science, and statistics. The post-holder is predicted to have robust analytical abilities and programming abilities in Python, Matlab or C/C++.  Choice shall be given to those that have expertise on generative AI fashions, audio era, cross modal translations (corresponding to, textual content to audio, video to audio), however candidates who’ve expertise in machine studying and audio-visual processing are welcome to use.  

 The right way to apply

 Please submit a CV and canopy letter along with your software, on the College web site. For casual inquiries, please contact Prof Wenwu Wang (E-mail: [email protected] ; Net: https://personalpages.surrey.ac.uk/w.wang/ ).

Please word, interviews scheduled to happen week commencing twenty ninth April. 

CVSSP is an Worldwide Centre of Excellence for analysis in Audio-Visible Machine Notion and AI, with over 180 researchers. The Centre has state-of-the-art audio and video seize and evaluation amenities supporting analysis in real-time video and audio processing and visualisation. CVSSP has a compute facility with 200 GPUs and >2PB of high-speed safe storage.

Please word, it’s College Coverage to supply a beginning wage equal to Degree 3.6 (£34,980) to profitable candidates who’ve been awarded, however are but to obtain, their PhD certificates.  As soon as the unique PhD certificates has been submitted to the native HR Division, the wage shall be elevated to Degree 4.1 (£36,024).

View or Apply
To assist us monitor our recruitment effort, please point out in your cowl/motivation letter the place (globalvacancies.org) you noticed this job posting.

Job Location