You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+34-3
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# CapDec: Text-Only Training for Image Captioning using Noise-Injected CLIP
2
2
3
3
4
-
## Official implementation for the paper ["CapDec: Text-Only Training for Image Captioning using Noise-Injected CLIP"](https://arxiv.org/abs/2211.00575) (EMNLP 2022).
4
+
## Official implementation for the paper ["CapDec: Text-Only Training for Image Captioning using Noise-Injected CLIP"](https://arxiv.org/abs/2211.00575), EMNLP 2022 (findings).
parser.add_argument('--data', default='clip_embedding.pkl', help='path to clip embeddings of captions generated by the attached embeddings_generator script')
398
+
parser.add_argument('--val_pt', default='', help='path to clip embeddings of validations set')
399
+
parser.add_argument('--pretrain_weights', default='', help='path to pretrained weights, if not specified, will train from scratch')
400
+
parser.add_argument('--out_dir', default='./checkpoints', help='path to output directory')
401
+
parser.add_argument('--add_modality_offset', dest='add_modality_offset', action='store_true', default=False, help='train with modality offset that was pre calculated at others/CLIP_embeddings_centers_info.pkl')
402
402
parser.add_argument('--prefix', default='coco_prefix', help='prefix for saved filenames')
parser.add_argument('--only_prefix', dest='only_prefix', action='store_true', default=False, help='train only the mapper between CLIP and GPT, while GPT is frozen')
413
+
parser.add_argument('--mapping_type', type=str, default='transformer', help='type of architurctre between CLIP and GPT (mlp/transformer)')
414
+
parser.add_argument('--num_layers', type=int, default=8, help='number of layers in the mapper')
415
+
parser.add_argument('--is_not_rn', dest='is_not_rn', action='store_true', default=False, help='Choose the CLIP backbone: False for RN, True for ViT')
416
+
parser.add_argument('--use_image_embedding_as_clipcap', dest='use_image_embedding_as_clipcap', action='store_true', default=False, help='use image embedding as ClipCap')
0 commit comments