Hi, thank you for the great work on this project!
I would like to ask whether there is an example or reference that demonstrates how different data modalities are represented and organized in practice, especially for multimodal settings.
If there is an existing example (e.g., a toy dataset, demo CSV file, or code snippet) that illustrates these points, it would be very helpful.
Thank you very much for your time and help!
Hi, thank you for the great work on this project!
I would like to ask whether there is an example or reference that demonstrates how different data modalities are represented and organized in practice, especially for multimodal settings.
If there is an existing example (e.g., a toy dataset, demo CSV file, or code snippet) that illustrates these points, it would be very helpful.
Thank you very much for your time and help!