Skip to content
标签
Theory of Mind
LLM
Intention Analysis
Words
248 字
Reading Time
2 分钟

The idea below is rough and still in its early stages, as it originated entirely from the insights in paper [1]. It requires a thorough literature review and discussions with experts to further develop it.

Background:

Paper [1] provides a exciting categorization of the belief inference and goal inference in your proposed question types, which defines the atomic inference in complex actions. In reality, human intentions are often complex and layered, involving hierarchical goals (e.g. “find the water glass” as part of “boil the milk”). This complexity presents challenges that humans do not always approach tasks in a strictly subgoal-by-subgoal manner. Instead, they frequently adapt, reprioritize, and make adjustments based on context. This poses challenges to the perceive and adapt to the contextual shift.

Key Challenges:

  • Hierarchical Goals: Recognizing that goals often contain nested subgoals (like “finding a water glass” as part of “boiling milk”) provides a realistic depiction of human goal-seeking
  • Real-time Goal Shifts: The intentions and goals are often flexible, and contextually driven rather than rigidly following a linear subgoal process. Given this complexity, achieving adaptive systems requires a model that can account for contextual shifts, reprioritization, and situational awareness

Method:

Build (Modify) the benchmark first...

References

[1] MMToM-QA: Multimodal Theory of Mind Question Answering, 202401, JHU


It is highly relevant to the multi-agent social simulation work I'm interested in. I'll continue it in my spare time.

--Edited on 2024/10/30

Contributors

The avatar of contributor named as Hua Tang Hua Tang

File History

Written by Normal Person

布局切换

调整 VitePress 的布局样式,以适配不同的阅读习惯和屏幕环境。

全部展开
使侧边栏和内容区域占据整个屏幕的全部宽度。
全部展开,但侧边栏宽度可调
侧边栏宽度可调,但内容区域宽度不变,调整后的侧边栏将可以占据整个屏幕的最大宽度。
全部展开,且侧边栏和内容区域宽度均可调
侧边栏宽度可调,但内容区域宽度不变,调整后的侧边栏将可以占据整个屏幕的最大宽度。
原始宽度
原始的 VitePress 默认布局宽度

页面最大宽度

调整 VitePress 布局中页面的宽度,以适配不同的阅读习惯和屏幕环境。

调整页面最大宽度
一个可调整的滑块,用于选择和自定义页面最大宽度。

内容最大宽度

调整 VitePress 布局中内容区域的宽度,以适配不同的阅读习惯和屏幕环境。

调整内容最大宽度
一个可调整的滑块,用于选择和自定义内容最大宽度。

聚光灯

支持在正文中高亮当前鼠标悬停的行和元素,以优化阅读和专注困难的用户的阅读体验。

ON开启
开启聚光灯。
OFF关闭
关闭聚光灯。

聚光灯样式

调整聚光灯的样式。

置于底部
在当前鼠标悬停的元素下方添加一个纯色背景以突出显示当前鼠标悬停的位置。
置于侧边
在当前鼠标悬停的元素旁边添加一条固定的纯色线以突出显示当前鼠标悬停的位置。