Repositories | Songyang Zhang

Sy-Zhang

Python

Code for the EMNLP 2022 paper "Learning a Grammar Inducer by Watching Millions of Instructional YouTube Videos."

EMNLP 2022

grammar induction instructional video

mugen-org

Python

Training, evaluation, and inference code for multimodal video-audio-text generation and retrieval baselines on MUGEN.

ECCV 2022

multimodal generation retrieval

Sy-Zhang

Python

Video-aided unsupervised grammar induction, awarded NAACL 2021 Best Long Paper.

NAACL 2021

grammar induction best paper

microsoft

Python

A collection of video cross-modal models, including code for expanding language-image pretrained models to video.

ECCV 2022

video recognition cross-modal learning

Sy-Zhang

Python

Code for the ACM MM 2019 paper "Exploiting Temporal Relationships in Video Moment Localization with Natural Language."

ACM MM 2019

moment localization video-language

Sy-Zhang

Lua

Code for the WACV 2017 paper "On Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks."

WACV 2017

action recognition skeleton features

Sy-Zhang

MATLAB

Reimplementation of "LIME: A Method for Low-light Image Enhancement" from ACM MM 2016.

ACM MM 2016

image enhancement low-light imaging