Ethan's Homepage
Ethan's Homepage
AboutProjectsBlogsTutorialsPublicationContact

2.

Date of Prediction
Feb 24, 2024
Fulfilled
Fulfilled
Content
There will be a visual tokenizer for NLP tasks
What Makes for Good Visual Tokenizers for Large Language Models?
We empirically investigate proper pre-training methods to build good visual tokenizers, making Large Language Models (LLMs) powerful Multimodal Large Language Models (MLLMs). In our benchmark,...
What Makes for Good Visual Tokenizers for Large Language Models?
https://arxiv.org/abs/2305.12223
What Makes for Good Visual Tokenizers for Large Language Models?
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for...
The tokenizer, as one of the fundamental components of large models, has long been overlooked or even misunderstood in visual tasks. One key factor of the great comprehension power of the large...
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for...
https://arxiv.org/abs/2403.18593
Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for...
 
Copyright 2025 Ethan Wang