The test setup: video material and targeted questions

Learn, share, and connect around europe dataset solutions.
Post Reply
Reddi2
Posts: 390
Joined: Sat Dec 28, 2024 8:51 am

The test setup: video material and targeted questions

Post by Reddi2 »

A specific video was selected for the test, which is more than 43 minutes long and deals with the topic of renewable energy (link to video) . What is special about this test is that the video presents information not only aurally, but also visually, without always mentioning it verbally. Two targeted questions were asked to test the capabilities of Gemini 1.5:

Question number 1: “Where in the video can you see a green stone?”

This question tests Gemini 1.5's ability to identify visual elements in a long video and indicate exactly when that element appears.

Question number 2: “What color is the jacket of italy phone number data the woman with the red glasses and where can you find her in the video?”

This question aims to test the accuracy of Gemini 1.5 in identifying specific details about people in a video.

The answers provided by Gemini 1.5 were impressively precise:



The green stone was located at minute 19:30 in the video.
The woman with red glasses and a yellow jacket was found at the time marks 27:18 and 28:36 .
These results demonstrate how effective Gemini 1.5 is at extracting and accurately processing visual and contextual information from a video. The large context window allows the model to remember details and accurately attribute them over a longer period of time.
Post Reply