Knowledge Enhanced Zero-shot Visual Relationship Detection

Code for paper 'Knowledge Enhanced Zero-shot Visual Relationship Detection'.

Introduction

The model comprises two modules: logic tensor networks encoded negative domain of semantic and spatial knowledge, and a commonsense knowledge graph module updated by local spatial structure as positive domain semantic knowledge. Predictions are further constrained by region connection calculus (RCC).

Using Code

The models folder contains the trained grounded theories of the experiments;
The Visual-Relationship-Detection-master folder contains the object detector model and the evaluation code provided in https://github.com/Prof-Lu-Cewu/Visual-Relationship-Detection for the evaluation of the phrase, relationship and predicate detection tasks on the VRD.
The data folder contains the data which can be downloaded from https://cs.stanford.edu/people/ranjaykrishna/vrd/
The ConceptNet can be downloaded from https://github.com/commonsense/conceptnet-numberbatch

Requirements

The packages needed in training can be downloaded following :

 pip install -r requirements.txt

Training

Use the complete model:

$ python train_all.py

Use LTNs :

$ python train.py

Use without spatial knowledge :

$ python train_mul.py

Use without CKG module :

$ python train_RCC.py

The trained models are saved in the models folder in the files KB_wc_2500.ckpt (with constraints). The number in the filename (2500) is a parameter in the code to set the number of iterations.

Evaluating

To run the evaluation use the following commands

$ python predicate_detection_mul.py$ python relationship_phrase_detection_mul.py

Then, launch Matlab, move into the Visual-Relationship-Detection-master folder, execute the scripts predicate_detection_LTN.m and relationship_phrase_detection_LTN.m and see the results.

Acknowledgement

This repository is based on our references [3] and [5]

[3] Chen, J., He, H., Wu, F., Wang, J.: Topology-aware correlations between relations for inductive link prediction in knowledge graphs. In: AAAI. vol. 35, pp. 6271–6278 (2021)

[5] Donadello, I., Serafini, L.: Compensating supervision incompleteness with prior knowledge in semantic image interpretation. In: IJCNN. pp. 1–8. IEEE (2019).

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Visual-Relationship-Detection-master		Visual-Relationship-Detection-master
data		data
generate_ckg		generate_ckg
models		models
CKG_vrd_pro.pkl		CKG_vrd_pro.pkl
RCC_label.csv		RCC_label.csv
RCC_label_negative.csv		RCC_label_negative.csv
README.md		README.md
changCKG_andtoTripet.csv		changCKG_andtoTripet.csv
getCKG.py		getCKG.py
judgeRCC.py		judgeRCC.py
logictensornetworks.py		logictensornetworks.py
mlp.py		mlp.py
multi_hop.py		multi_hop.py
predicate_detection_mul.py		predicate_detection_mul.py
rcc_constrain_ori.csv		rcc_constrain_ori.csv
refine.py		refine.py
relationship_detection_mul.py		relationship_detection_mul.py
requirements.txt		requirements.txt
train.py		train.py
train_RCC.py		train_RCC.py
train_all.py		train_all.py
train_mul.py		train_mul.py
visual_relationship_dataset.py		visual_relationship_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Knowledge Enhanced Zero-shot Visual Relationship Detection

Introduction

Using Code

Requirements

Training

Evaluating

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

laigroup/K-VRD

Folders and files

Latest commit

History

Repository files navigation

Knowledge Enhanced Zero-shot Visual Relationship Detection

Introduction

Using Code

Requirements

Training

Evaluating

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages