Peixi Xiong

Peixi Xiong

Staff AI Research Scientist/Engineer & Analog Film Photographer

Ph.D. in Computer Vision from Northwestern University, advised by Prof. Ying Wu. Currently a Staff AI Research Scientist/Engineer at Intel Labs, working on multimodal learning, structured reasoning, and agentic AI. Also an analog film photographer shooting medium format landscapes.

Research & Academic

Focused on multimodal learning, structured reasoning, and foundation model applications, including Retrieval-Augmented Generation and agentic AI systems.

  • Published at CVPR, ECCV, EMNLP, ICLR, AAAI, ICCV, and WACV
  • Multiple U.S. patents in visual reasoning, RAG, and generative AI
  • Editorial Advisory Board Member, Information Processing & Management (Elsevier)

Analog Film Photography

Landscape-focused analog photography with ongoing collaborations with film brands, labs, manufacturers, and community features.

  • Medium format and 35mm analog film work
  • Self-developed C-41, E-6, and ECN-2 film processes
  • Collaborations with film brands, labs, manufacturers, and community features

Research Summary

My research sits at the intersection of computer vision and natural language processing. I work on multimodal learning, structured visual reasoning, and foundation model applications — including Retrieval-Augmented Generation (RAG) and agentic AI systems. Currently a Staff AI Research Scientist/Engineer at Intel Labs.

Agentic AI Structured Reasoning Multimodal Learning Visual Question Answering Computer Vision Retrieval-Augmented Generation

Selected Publications

A selection of recent and representative work. Click below to see the full list.

Patents

U.S. patents in visual reasoning, retrieval-augmented generation, and generative AI.

Selected Industry Experience

Prior research internships in computer vision and multimodal AI.

Research Intern
Samsung Research America — Mountain View, CA · Summer 2021
Proposed a hierarchical Transformer for VQA reasoning (MGA-VQA) and developed a decision fusion module for effective multi-Transformer collaboration.
Research Intern — Computer Vision
Microsoft Corporation — Redmond, WA · Summer 2020
Proposed a graph-based architecture for VQA reasoning (SA-VQA) and incorporated multi-head attention mechanisms for enhanced multimodal alignment.
Research Intern
SAIC Innovation Center — San Jose, CA · Summer 2018
Proposed a novel architecture for LiDAR-based autonomous driving performing simultaneous object detection and instance segmentation, with attention-enhanced small object detection.

Academic Service

Editorial Board

  • Editorial Advisory Board Member, Information Processing & Management (Elsevier)

Invited Reviewer

  • CVPR, ICCV, ECCV
  • NeurIPS, ICLR, AAAI, WACV
  • T-PAMI, IJCV

Invited Speaker

  • AIM-2024
  • AIM-2025

Education

Ph.D. in Computer Vision
Northwestern University — Evanston, IL · Advisor: Prof. Ying Wu
2018 – 2022 · Terminal Year Fellowship Recipient
M.S. in Electrical Engineering
Northwestern University — Evanston, IL
2016 – 2018
B.Eng. in Electronic Engineering
The University of Manchester — Manchester, UK
2013 – 2016

Featured Works

Landscape-focused analog photography. Collaborations with film brands, labs, manufacturers, and community features.

More on Instagram →

Gear & Process

Cameras

  • Hasselblad 203FE
  • Pentax 67II

Film Stocks

  • Kodak
  • Fujifilm
  • Harman
  • Lomography

Self-Developed Film

  • C-41
  • E-6
  • ECN-2

Development & Scanning

  • JOBO CPE-3
  • Epson V850 Pro

Contact