We presents MMICap, a deep-learning-based solution that uses multimodal inputs comprised of scientific images paired with detailed texts to automatically generate captions that effectively summarize the content depicted in these images.
-
Notifications
You must be signed in to change notification settings - Fork 0
We presents MMICap, a deep-learning-based solution that uses multimodal inputs comprised of scientific images paired with detailed texts to automatically generate captions that effectively summarize the content depicted in these images.
License
Prograf-UFF/MMICap
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
We presents MMICap, a deep-learning-based solution that uses multimodal inputs comprised of scientific images paired with detailed texts to automatically generate captions that effectively summarize the content depicted in these images.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published