Honeybee: Locality-enhanced Projector for Multimodal LLMdinov2, clipLLaVA-1.6: Improved reasoning, OCR, and world knowledgeLLaVA-MoE: Mixture of Experts for Large Vision-Language ModelsSPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsTableVQA-Bench: A Visual Question Answering Benchmarks on Multiple Table DomainsInterleaved data(InternLM-XComposer2 & MiniGPT..