Alex Martin's picture

3 22 4

Alex Martin

alexmartin1722

·

alexmartin1722

AI & ML interests

None yet

Recent Activity

new activity 3 months ago

hltcoe/wikivideo:update wrong videos

updated a dataset 3 months ago

hltcoe/wikivideo

upvoted a paper 4 months ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

View all activity

Organizations

New activity in hltcoe/wikivideo 3 months ago

update wrong videos

#3 opened 3 months ago by

updated a dataset 3 months ago

hltcoe/wikivideo

Viewer • Updated Nov 13, 2025 • 1.71k • 27 • 4

upvoted a paper 4 months ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10, 2025 • 51

liked a Space 5 months ago

FineVision: Open Data is All You Need

A new open-source dataset for training VLMs

upvoted a paper 7 months ago

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

Paper • 2506.22724 • Published Jun 28, 2025 • 10

upvoted a paper 9 months ago

Certified Mitigation of Worst-Case LLM Copyright Infringement

Paper • 2504.16046 • Published Apr 22, 2025 • 13

upvoted 3 papers 10 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 205

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published Apr 7, 2025 • 15

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published Apr 10, 2025 • 30

liked a dataset 10 months ago

hltcoe/MultiVENT2.0

Preview • Updated 9 days ago • 172 • 7

upvoted a collection 10 months ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated 4 days ago • 78

authored 4 papers 10 months ago

MegaWika: Millions of reports and their sources across 50 diverse languages

Paper • 2307.07049 • Published Jul 13, 2023

Grounding Partially-Defined Events in Multimodal Data

Paper • 2410.05267 • Published Oct 7, 2024

MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval

Paper • 2410.11619 • Published Oct 15, 2024 • 1

WikiVideo: Article Generation from Multiple Videos

Paper • 2504.00939 • Published Apr 1, 2025 • 37

updated a collection 10 months ago

MultiVENT and MAGMAR Resources

Resources associated with the MultiVENT datasets, MAGMAR workshop, and other video retrieval and multimodal retrieval augmented generation • 5 items • Updated Apr 4, 2025 • 1

upvoted a collection 10 months ago

MultiVENT and MAGMAR Resources

Resources associated with the MultiVENT datasets, MAGMAR workshop, and other video retrieval and multimodal retrieval augmented generation • 5 items • Updated Apr 4, 2025 • 1

updated a collection 10 months ago

MultiVENT and MAGMAR Resources

Resources associated with the MultiVENT datasets, MAGMAR workshop, and other video retrieval and multimodal retrieval augmented generation • 5 items • Updated Apr 4, 2025 • 1