大象传媒

COMMA

A Cloud Platform for Metadata Extraction

Published: 1 January 2011

The 大象传媒 has a vast catalogue of old TV and Radio programmes it would like to make searchable. COMMA is a platform for processing these cheaply and at scale.

Project from 2013 - 2015

What we are doing

COMMA was a 2-year project funded by the to develop a prototype platform for the extraction of metadata from media archives. The project completed in 2015 and the platform is now in use by the 大象传媒.

It was funded as part of the TSB's initiative. This aims to encourage innovation in the digital economy by funding partnerships between the public sector, business and academia. Our commercial partners in the project were , a London-based content design and creation company, and , an internet development consultancy.

Why it matters

There are many cultural institutions, commercial archives and content creators who have audio, film, photos and video that they would like to put to new uses.

The first step is usually digitisation, but the danger with a big digitisation project is you simply swap out an under-used physical archive for its digital equivalent. Without easy ways to navigate the data there's no way for your users to get to the bits they want.

Luckily, help is at hand in the shape of technologies like , , and a host of other metadata extraction algorithms. These can help unlock the value in media collections by making specific bits within the video or audio instantly findable.

COMMA is a platform that can help process content through algorithms like this cheaply and at scale. It is easy-to-use, fault-tolerant and flexible. It is currently being used in-house by the 大象传媒 for a variety of metadata processing tasks.

Outcomes

The project has now completed. However, we're interested in the needs of other public sector bodies or commercial companies who think they could benefit from this platform.

If you have any press or business-related questions about the project feel free to contact the COMMA team.

-

大象传媒 R&D - Using Algorithms to Understand Content

大象传媒 R&D - Artificial Intelligence In Broadcasting

大象传媒 R&D - Content Analysis Toolkit

Project Team

  • Matt Haynes

    Matt Haynes

    Principal Web Developer
  • James Harrison

    James Harrison

    Software Engineer
  • Dan Nuttall

    Dan Nuttall

    Software engineer
  • Chris Needham

    Chris Needham

    Principal Software Engineer
  • Tristan Ferne

    Tristan Ferne

    Lead Producer
  • Rob Cooper

    Rob Cooper

    Producer
  • Internet Research and Future Services section

    The Internet Research and Future Services section is an interdisciplinary team of researchers, technologists, designers, and data scientists who carry out original research to solve problems for the 大象传媒. 大象传媒 focuses on the intersection of audience needs and public service values, with digital media and machine learning. We develop research insights, prototypes and systems using experimental approaches and emerging technologies.

Rebuild Page

The page will automatically reload. You may need to reload again if the build takes longer than expected.

Useful links

Theme toggler

Select a theme and theme mode and click "Load theme" to load in your theme combination.

Theme:
Theme Mode: