Newswise — Researchers at Soochow University have developed an advanced artificial intelligence model called Dual-view Prompt and Element Correlation (DPEC). This model has outperformed leading methods in accurately extracting spatial relationships from written descriptions. This breakthrough marks a significant advance in natural language processing—a key technology behind self-driving cars, digital mapping, and smart personal assistants.
Decoding Spatial Descriptions: How AI Sees Its Surroundings
Spatial relation extraction is the process by which an AI understands how different objects or locations are positioned relative to each other. For example, the AI can interpret descriptions like “the car behind the building” or “the plane above the clouds.” Such understanding is essential for precise navigation and real-time decision-making in many technology areas.
AI Meets the Autonomous Driving Boom
At the same time, the global autonomous driving market is projected to reach hundreds of billions of dollars by 2030, highlighting the urgent need for highly accurate natural language understanding tools. DPEC’s improved performance indicates that this technology could benefit the rapidly growing autonomous driving industry.
DPEC Outshines Legacy Models
One of the study’s achievements is DPEC’s ability to significantly outperform older models such as R-BERT (a widely-used pre-trained language model) and HMCGR (a leading hybrid model) when tested on the well-known SpaceEval dataset. In addition to its strong performance, DPEC shows exceptional accuracy in interpreting spatial relationships, effectively resolving ambiguous descriptions that have long challenged existing systems. Its strength is accurately identifying dynamic spatial relations— which is critical for real-world applications like autonomous driving, drone navigation, and advanced digital mapping services.
Prof. Qiaoming Zhu, the lead researcher, commented, “Our innovative approach with DPEC not only overcomes the limitations of existing models but also opens up a whole new realm of possibilities in AI-driven spatial understanding, ensuring that technology can reliably interpret even the most complex spatial cues.”
The Dual-View and Element Correlation Advantage
The innovative design of DPEC includes two complementary modules. The first, known as the “Dual-view Prompt,” consists of a “Link Prompt” that helps the model understand the context and a “Confidence Prompt” that clears up ambiguities in the description. The second component, “Element Correlation,” evaluates the consistency among the spatial elements within a description, significantly reducing the misinterpretations common in older approaches.
By directly addressing the weaknesses of earlier methods and offering a clearer understanding of spatial relationships, DPEC sets a new standard in natural language processing. This advancement opens up new opportunities to improve products and services that rely on spatial interpretation, such as autonomous vehicles, location-based applications, and sophisticated digital assistants. The complete study is accessible via DOI: 10.1007/s11704-023-3305-4.
MEDIA CONTACT
Register for reporter access to contact detailsArticle Multimedia

Credit: Feng WANG
Caption: An example of spatial relation extraction where three spatial relations are extracted from the sentences

Credit: Feng WANG
Caption: Overall structure of our model DPEC

Credit: Feng WANG
Caption: Candidate triplet extraction

Credit: Feng WANG
Caption: Three sub-relation distances of spatial elements for BERT (upper line) and DPEC (bottom line)

Credit: Feng WANG
Caption: Trigger distribution of QSLINKs and MOVELINKs
CITATIONS