Home
Softono
EgoLife

EgoLife

Open source Python
436
Stars
19
Forks
12
Issues
8
Watchers
1 year
Last Commit

About EgoLife

EgoLife is a CVPR 2025 research project focused on developing an egocentric life assistant. It features a comprehensive multimodal dataset capturing daily activities of six participants over a week using Meta Aria glasses, synchronized third-person cameras, and mmWave sensors. This rich resource supports long-term video understanding and real-world AI applications. The project introduces two core models: EgoGPT, an omni-modal vision-language model fine-tuned for egocentric scenarios that performs continuous video captioning to extract key events, actions, and context from first-person video and audio streams; and EgoRAG, a retrieval-augmented generation module enabling long-term reasoning and memory reconstruction. EgoRAG utilizes a hierarchical memory bank with hourly and daily summaries to retrieve time-stamped relevant past events for context-aware question answering. The system assists users with memory support, habit tracking, event recall, and task management. EgoLife provides public access to the datas

Platforms

Web Self-hosted

Languages

Python

The EgoLife Project

           

teaser.png

Figure 1. The Overview of EgoLife Project. EgoLife is an ambitious egocentric AI project capturing multimodal daily activities of six participants over a week. Using Meta Aria glasses, synchronized third-person cameras, and mmWave sensors, it provides a rich dataset for long-term video understanding. Leveraging this dataset, the project enables AI assistantsโ€”powered by EgoGPT and EgoRAGโ€”to support memory, habit tracking, event recall, and task management, advancing real-world egocentric AI applications.

๐Ÿš€ News

๐Ÿคน 2025-02: We provide [HuggingFace gradio demo]() and [self-deployed demo]() for EgoGPT.

๐ŸŒŸ 2025-02: The EgoLife video is released at HuggingFace and uploaded to Youtube as video collection.

๐ŸŒŸ 2025-02: We release the EgoIT-99K dataset at HuggingFace.

๐ŸŒŸ 2025-02: We release the first version of EgoGPT and EgoRAG codebase.

๐Ÿ“– 2025-02: Our arXiv submission is currently on hold. For an overview, please visit our academic page.

๐ŸŽ‰ 2025-02: The paper is accepted to CVPR 2025. Please be invited to our online EgoHouse.

What is in this repo?

๐Ÿง  EgoGPT: Clip-Level Multimodal Understanding

EgoGPT is an omni-modal vision-language model fine-tuned on egocentric datasets. It performs continuous video captioning, extracting key events, actions, and context from first-person video and audio streams.

Key Features:

  • Dense captioning for visual and auditory events.
  • Fine-tuned for egocentric scenarios (optimized for EgoLife data).

๐Ÿ“– EgoRAG: Long-Context Question Answering

EgoRAG is a retrieval-augmented generation (RAG) module that enables long-term reasoning and memory reconstruction. It retrieves relevant past events and synthesizes contextualized answers to user queries.

Key Features:

  • Hierarchical memory bank (hourly, daily summaries).
  • Time-stamped retrieval for context-aware Q&A.

๐Ÿ“‚ Code Structure

EgoLife/
โ”‚โ”€โ”€ assets/                # General assets used across the project
โ”‚โ”€โ”€ EgoGPT/                # Core module for egocentric omni-modal model
โ”‚โ”€โ”€ EgoRAG/                # Retrieval-augmented generation (RAG) module
โ”‚โ”€โ”€ README.md              # Main documentation for the overall project

Please dive in to the project of EgoGPT and EgoRAG for more details.

๐Ÿ“ข Citation

If you use EgoLife in your research, please cite our work:

@misc{yang2025egolifeegocentriclifeassistant,
      title={EgoLife: Towards Egocentric Life Assistant}, 
      author={Jingkang Yang and Shuai Liu and Hongming Guo and Yuhao Dong and Xiamengwei Zhang and Sicheng Zhang and Pengyun Wang and Zitang Zhou and Binzhu Xie and Ziyue Wang and Bei Ouyang and Zhengyu Lin and Marco Cominelli and Zhongang Cai and Yuanhan Zhang and Peiyuan Zhang and Fangzhou Hong and Joerg Widmer and Francesco Gringoli and Lei Yang and Bo Li and Ziwei Liu},
      year={2025},
      eprint={2503.03803},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.03803}, 
}

๐Ÿ“ License

This project is licensed under the S-Lab license. See the LICENSE file for details.

Star History

Star History Chart