I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

296 points by iliashad 12 hours ago


TLDR: I had 2,207 GoPro videos, and I need to rewatch them to find interesting moments from my cycling journey. I built a project to index them locally on my M1 Max using open-source ML models, search for those moments, and send the best clips straight to my DaVinci Resolve timeline. I indexed 628 videos (668.68 GB, 15h 13m 18s of footage duration), more details in the metrics table in the last section of this article.

Full article: https://iliashaddad.com/blog/i-indexed-669-gb-of-my-gopro-videos-using-my-m1-max-computer

asenna - 8 hours ago

Funny this is almost EXACTLY what I did a few days ago on the same machine using very similar techniques and was on the front-page of HN as well:

https://news.ycombinator.com/item?id=48222733 https://blog.simbastack.com/indexed-a-year-of-video-locally/

I wasn't familiar with your project though, interesting stuff.

I'm trying to add more photography related features to Framedex but yeah there's so much we can do locally, exciting times.

justinram11 - 6 hours ago

Something I've enjoyed more than I expected is Google and Apple photos sending me photo memories and compilations of various things in my life and my kids lives over the last decade.

I'm really bullish on taking more video of my kids, with the thought that it will become easier and easier for AI to put them into little compilations I can enjoy later.

esjeon - 5 hours ago

> Then, run the frame analysis pipeline, which will divide the video into separate video scenes (1s each, or 1fps) > (…) > Frames analyzed 57,537

Aha, it makes total sense. This number sounds much more reasonable than “669 GB”, since the actual total size of processed frames would be like 10-30 GB.

(Not downplaying anything. Doing-at-home always requires some math on practicality)

> Total compute time 67h 40m 42s

I’m just curious tho — is there any paying options that can accelerate this kind of process? Just spin up GPU instances?

robrain - 8 hours ago

DaVinci 21 has indexing built-in (AI IntelliSearch). Not to diminish the work you did, but this is now available to many users (probably only Studio users since it has AI in the name)

Beijinger - 11 hours ago

Does it work for porn collections too?

WarOnPrivacy - 9 hours ago

I was surprised to learn that the

    M1 Max CPU is an ARM/SoC, comparable to an 11th gen Intel i9
Do I have it right? Would Windows ARM performance be similar for those cpu?

ref: https://www.cpubenchmark.net/compare/4585vs4245/Apple-M1-Max...

lgats - 11 hours ago

the link https://iliashaddad.com/blog/i-indexed-669-gb-of-my-gopro-vi...

cake-rusk - 6 hours ago

I have an RTX 5090 card but it only has 32 GB RAM, can something like this work on my machine?

tontonius - 7 hours ago

if anyone is interested in searching large video collections local and offline I suggest taking a look at Jumper https://docs.getjumper.io

comes with some nifty features like NLE- integrations, people search, MCP, API etc

Disclaimer: one of the co-founders

asdfasgasdgasdg - 4 hours ago

Cool build but the example videos you provide at the end are . . . not what I would hope for when thinking about the highlights of 2000+ videos of biking? For example the dog barking video only has one scene repeated two or three times and it's five seconds long?

fl0id - 9 hours ago

it is possible to use apple gpu with containers. either with podman + runkit + recent mesa or with recent vllm-metal from docker https://www.docker.com/blog/docker-model-runner-vllm-metal-m...

WhitneyLand - 8 hours ago

I’d like to see embedding of actual video clips become practical in this type of workflow.

Frame level embedding it covering a lot, but can miss out on a lot of action related searches.

wferrell - 7 hours ago

https://iliashaddad.com/blog/i-indexed-669-gb-of-my-gopro-vi...

rho138 - 11 hours ago

This would fit most best as a “Show HN:” post :)

PreownedPlaid - 4 hours ago

this is really cool. was looking to do something similar on mbp 64gb

iliashad - 10 hours ago

I would love your feedback and suggestions for new improvements or features you wanna have, either in the source available version, the desktop app or blog post itself?

synergy20 - 2 hours ago

can vlm be used instead or it's too heavy and slow

- 8 hours ago
[deleted]
m3kw9 - 9 hours ago

Grab frames, lower res, classify, combine meta data. Write to sql

nyxtom - 7 hours ago

Now this ^^ is an awesome use case!

Mawr - 3 hours ago

> Many of the videos I captured amazing moments, and sometimes it's kind of hard to watch the full videos to get those moments.

Yep. I had the same problem.

> Then, run the frame analysis pipeline [...] I have a face recognition plugin using my custom faces data, object detection, on-screen text, shot type, and scene description [...] we will have three vector DB collections that have all the information about our videos, like video location metadata, camera name, faces recognized, objects detected, on-screen text, transcription, description of each scene, and many more [...] we can get better indexed data if you use the advanced mode indexing to use the Qwen2.5-VL-7B-Instruct model to understand and describe your video much better, but at a slower indexing speed

Yeah, uhm... ok :)

If anyone else has a similar problem, the real solution is as follows:

1. When recording, if you witness an interesting moment worth saving later, press the power button — this will mark the current moment in the video as a chapter.

2. Find the chapters later when editing and cut them into clips.

3. You're done :)

This has two main benefits over the insanity above:

1. It's trivially simple instead of insanely complex and inefficient.

2. It will reliably catch all the stuff you find interesting, since you're the one doing the marking.

The downsides:

1. Doesn't work retroactively.

2. It may miss interesting stuff if you miss it at the time as well.

3. Only works for this use case.

4. Nerds won't salivate over your usage of cutting edge tech.

dadachi - 3 hours ago

[flagged]

tosief - 2 hours ago

[dead]

knightops_dev - 8 hours ago

[flagged]

knightops_dev - 9 hours ago

[flagged]

GreenSalem - 2 hours ago

A lawyer I know who specialises in rape, and is excellent at getting the obviously guilty exonerated, lost a case last year because of GoPro videos.

Her client was recording while committing the abhorrent crime. The criminal would otherwise have got off.

From my perspective, the GoPro camera produced a good outcome. Still, one has wonder why anyone to record their criminal actions.