SciCap_ai_banner.png

<aside> 🔗 Link to this page: SciCap.AI

</aside>

<aside> 🔗 This year, the challenge will be hosted in the 5th Workshop on Closing the Loop Between Vision and Language at ICCV 2023 (October 2-3, Paris, France).

</aside>

<aside> 🚨 We released our baseline code!

</aside>

<aside> 📢 The hidden test set was released — the Challenge Phase begins!

</aside>

<aside> 📢 Dear SciCap Challenge Participating Teams, please complete the Google Form to submit a 2-4 page report detailing your system. (Deadline: Sep 4 Sep 5, 2023 AOE) — Google Form link: https://forms.gle/17rhc8UWTq8pD4yD8

</aside>

1. Challenge Overview

Join the 1st Scientific Figure Captioning (SciCap) Challenge! We will supply each team with approximately 400,000 scientific figure images from various arXiv papers, including their respective captions and relevant paragraphs. Teams will then use these data to build computational models to generate captions for these images. Whether you are working alone or as a team, we welcome researchers, AI/NLP/CV practitioners, and anyone interested in computational models for generating useful text for visuals to participate and submit their results.

This year, the challenge will be hosted in the 5th Workshop on Closing the Loop Between Vision and Language at ICCV 2023 (October 2-3, Paris, France).

Check out the details of the challenge, including data, code, baselines, evaluation criteria, and important dates. We eagerly await your participation in the 1st SciCap Challenge!

For questions about the challenge, email us at [email protected].

Zoom Office Hour

The organizers will host a 30-minute-long Zoom office hour to answer all kinds of questions. Please do not hesitate to join us!

One Challenge, Two Phases

The challenge consists of two stages: the Test Phase and the Challenge Phase.