Publications

Published October 17, 2022

MACRONYM: A Large-Scale Dataset for Multilingual and Multi-Domain Acronym Extraction

COLING 2022

Amir Pouran Ben Veyseh, Nicole Meister, David Seunghyun Yoon, Rajiv Jain, Franck Dernoncourt, Thien Huu Nguyen
  • AI & Machine Learning
  • Document Intelligence
  • Natural Language Processing

Published October 17, 2022

Keyphrase Prediction from Video Transcripts: New Dataset and Directions

COLING 2022

Amir Pouran Ben Veyseh, Quan Hung Tran, David Seunghyun Yoon, Varun Manjunatha, Hanieh Deilamsalehy, Rajiv Jain, Trung Bui, Walter W. Chang, Franck Dernoncourt, Thien Huu Nguyen
  • AI & Machine Learning
  • Document Intelligence
  • Natural Language Processing

Published October 14, 2022

MONOPOLY: Financial Prediction from MONetary POLicY Conference Videos Using Multimodal Cues

ACM Multimedia 2022

Puneet Mathur, Atula Tejaswi Neerkaje, Malika Chhibber, Ramit Sawhney, Fu-Ming Guo, Franck Dernoncourt, Sanghamitra Dutta, Dinesh Manocha
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Content Intelligence
  • Natural Language Processing

Published October 10, 2022

Show Me What I Like: Detecting User-Specific Video Highlights Using Content-Based Multi-Head Attention

ACM International Conference on Multimedia

Uttaran Bhattacharya, Gang Wu, Stefano Petrangeli, Vishy Swaminathan, Dinesh Manocha
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Content Intelligence

Published October 5, 2022

StylePortraitVideo: Editing Portrait Videos using StyleGAN

Pacific Graphics 2022

Kwanggyoon Seo, Seoung Wug Oh, Joon-Young Lee, Jingwan (Cynthia) Lu, Seonghyeon Kim, Junyong Noh
  • AI & Machine Learning
  • Computer Vision, Imaging & Video

Published October 2, 2022

ViSRE: A Unified Visual Analysis Dashboard for Proactive Cloud Outage Management

IEEE Working Conference on Software Visualization (VISSOFT)

Paula Kayongo, Jane Hoffswell, Shiv Saini, Shaddy Garg, Eunyee Koh, Haoliang Wang, Tom Jacobs
  • Human Computer Interaction

Published October 1, 2022

VRDoc: Gaze-based Interactions for VR Reading Experience

IEEE International Symposium on Mixed and Augmented Reality (ISMAR)

Geonsun Lee, Jennifer Healey, Dinesh Manocha
  • AR, VR & 360 Photography
  • Document Intelligence
  • Human Computer Interaction

Published September 22, 2022

DocLayoutTTS: Dataset and Baselines for Layout-informed Document-level Neural Speech Synthesis

Interspeech 2022

Puneet Mathur, Franck Dernoncourt, Quan Hung Tran, Jiuxiang Gu, Ani Nenkova, Vlad Morariu, Rajiv Jain, Dinesh Manocha
  • AI & Machine Learning
  • Document Intelligence
  • Natural Language Processing

Published September 18, 2022

Audio Similarity is Unreliable as a Proxy for Audio Quality

Interspeech 2022

Pranay Manocha, Zeyu Jin, Adam Finkelstein
  • Audio

Published September 18, 2022

Filler Word Detection and Classification: A Dataset and Benchmark

23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)

Ge Zhu, Juan-Pablo Caceres, Justin Salamon
  • AI & Machine Learning
  • Audio

Published September 5, 2022

Meta-learning for Adaptive Filters with Higher-order Frequency Dependencies

IEEE International Workshop on Acoustic Signal Enhancement

Junkai Wu, Jonah Casebeer, Nicholas J. Bryan, Paris Smaragdis
  • AI & Machine Learning
  • Audio

Published September 1, 2022

Style Transfer of Audio Effects with Differentiable Signal Processing

Journal of the Audio Engineering Society

Christian Steinmetz, Nicholas J. Bryan, Joshua Reiss
  • AI & Machine Learning
  • Audio

Published August 9, 2022

Can one hear the shape of a neural network?: Snooping the GPU via Magnetic Side Channel

USENIX Security 2022

Henrique Teles Maia, Chang Xiao, Dingzeyu Li, Eitan Grinspun, Changxi Zheng
  • AI & Machine Learning
  • Audio
  • Systems & Languages

Published August 8, 2022

Neural Jacobian Fields: Learning Intrinsic Mappings of Arbitrary Meshes

SIGGRAPH

Noam Aigerman, Kunal Gupta, Vladimir (Vova) Kim, Siddhartha Chaudhuri, Jun Saito, Thibault Groueix
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Graphics (2D & 3D)

Published August 8, 2022

MatFormer: A Generative Model for Procedural Materials

ACM Transactions on Graphics (Proc. SIGGRAPH 2022)

Paul Guerrero, Miloš Hašan, Kalyan Sunkavalli, Radomír Měch, Tamy Boubekeur, Niloy J. Mitra
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Graphics (2D & 3D)

Published August 8, 2022

Moving Level-of-Detail Surfaces

ACM Transactions on Graphics (Proc. SIGGRAPH 2022)

Corentin Mercier, Thibault Lescoat, Pierre Roussillon, Tamy Boubekeur, Jean Thiery
  • Graphics (2D & 3D)

Published August 7, 2022

Clustered Vector Textures

SIGGRAPH 2022

Peihan Tu, Li-Yi Wei, Matthias Zwicker
  • Graphics (2D & 3D)

Published August 3, 2022

Active Exploration for Neural Global Illumination of Variable Scenes

ACM Transactions on Graphics (SIGGRAPH 2022)

Stavros Diolatzis, Julien Philip, George Drettakis
  • AI & Machine Learning
  • Graphics (2D & 3D)

Published August 1, 2022

Node Graph Optimization Using Differentiable Proxies

Siggraph proc.

Yiwei Hu, Paul Guerrero, Miloš Hašan, Holly Rushmeier, Valentin Deschaintre
  • AI & Machine Learning
  • Graphics (2D & 3D)

Published August 1, 2022

Fine Wrinkling on Coarsely Meshed Thin Shells

ACM Transactions on Graphics (SIGGRAPH 2022)

Zhen Chen, Hsiao-Yu Chen, Danny Kaufman, Mélina Skouras, Etienne Vouga
  • AR, VR & 360 Photography
  • Graphics (2D & 3D)
1 2 3 4 5 6 7 101