大象传媒

Natural language processing

Automated tools for text processing and analysis

Published: 1 January 2011

Natural Language Processing (NLP) applies the power of computing to the complexity and nuance of human language. At 大象传媒 R&D, we are exploring how NLP can help us better understand and serve our audiences.

Project from 2011 - present


What we are doing 

Our research focuses on a variety of NLP applications, such as semantic search, summarisation and sentiment analysis. We are interested in both established NLP techniques and emerging methods based on Large Language Models (LLMs).

The 大象传媒 absorbs and creates large amounts of textual material during its day-to-day operations. To help staff exploit this information 大象传媒 R&D has developed several text tools.

Tool Name  

Function

Description

Starfruit Tag suggestion Tag suggestion system based on previous choices by journalists
Citron Quote extraction Quote extraction and attribution system
Vox Abuse detection Detects personal abuse and offensive comments
Emo Sentiment  analysis Predicts the emotional impact of news articles
Yuzu Topic segmentation   Segments news bulletins and magazine programmes by topic
Primo Semantic search & analysis    Applies Large Language Models to document collections

How it works

We use state-of-the-art  techniques and apply Large Language Models to news articles, subtitle streams and speech-to-text transcripts.

Project Team

  • Chris Newell

    Chris Newell

    Lead R&D Engineer
  • Internet Research and Future Services section

    The Internet Research and Future Services section is an interdisciplinary team of researchers, technologists, designers, and data scientists who carry out original research to solve problems for the 大象传媒. 大象传媒 focuses on the intersection of audience needs and public service values, with digital media and machine learning. We develop research insights, prototypes and systems using experimental approaches and emerging technologies.

Rebuild Page

The page will automatically reload. You may need to reload again if the build takes longer than expected.

Useful links

Theme toggler

Select a theme and theme mode and click "Load theme" to load in your theme combination.

Theme:
Theme Mode: