Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.14 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)
Background: In this manuscript, we introduce SceneScribe, a versatile semi-automatic tool designed for annotating scene graphs within images. SceneScribe empowers human annotators by providing them with the capability to articulate and depict the intricate relationships among participants observed within visual scenes, effectively constructing directed graphs. This functionality serves as a foundational pillar for various downstream tasks in computer vision, including but not limited to image captioning, Visual Question Answering (VQA), and scene graph generation, fostering enhanced learning and reasoning capabilities in these domains.
Method: SceneScribe offers unparalleled flexibility in managing annotations across diverse image datasets. Annotators can choose to consolidate annotations into a single VG150 data-format file, seamlessly integrating with various scene graph models for enhanced compatibility. Alternatively, annotations can be segmented into separate files for individual images, enabling the creation of custom datasets tailored to specific research goals. Additionally, SceneScribe includes an intelligent rule-based algorithm that recommends relationships based on predefined criteria, streamlining the annotation process and reducing the burden associated with manual efforts. This combination of features enhances efficiency and productivity in scene graph annotation tasks.
Results: To demonstrate the practical utility and effectiveness of SceneScribe, we introduce Traffic Genome, a comprehensive scene graph dataset consisting of 1000 diverse traffic images. Through the creation and annotation of Traffic Genome using SceneScribe, we showcase the software's capability to facilitate accurate and efficient scene graph annotation across a range of real-world scenarios and applications.
Conclusion: Overall, SceneScribe represents a valuable tool in the realm of computer vision, offering a user-friendly and adaptable solution for annotating scene graphs in images, thereby advancing research and development in visual understanding and reasoning tasks.
Keywords:
Scene Graphs, Annotations, Relationships, Computer Vision, Automated Annotation System
Cite Article:
"SceneScribe: Automating Video Annotation with Textual Descriptions", International Journal of Science & Engineering Development Research (www.ijrti.org), ISSN:2455-2631, Vol.9, Issue 5, page no.332 - 344, May-2024, Available :http://www.ijrti.org/papers/IJRTI2405049.pdf
Downloads:
000205127
ISSN:
2456-3315 | IMPACT FACTOR: 8.14 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.14 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator