Edson Araujo

MSc. Student

VeRLab - Computer Vision and Robotics Laboratory

Universidade Federal de Minas Gerais

About Me

As an MSc student in VeRLab, with Prof. Erickson Nascimento, my main research focus is in exploring multi-modal learning techniques to leverage the natural audio-visual correspondence seen in many data formats, such as videos. More specifically, I’m mostly interested in tasks such as sound source localization, visual-based sound separation, and audio-based video summarization.

I am also part of the Semantic Hyperlapse project in our research group, which objective is to fast-forward egocentric videos as far as the semantic information is concerned. In our lab, I also happily contribute to projects in the following topics: Medical Image Analysis and Sports Analytics.

Previously, I’ve worked with the extraction of local features by learning local representations using CNNs^[1] and developing a platform-independent routing protocol that enables reliable and efficient any-to-any data traffic^[2].

Interests

Computer Vision
Multimodal Machine Learning
Self-Supervised Machine Learning

Education

MSc in Computer Vision, Current

Universidade Federal de Minas Gerais
BSc in Computer Science, 2019

Universidade Federal de Minas Gerais

Selected Publications

Washington L. S. Ramos, Michel M. Silva, Edson R. Araujo, Leandro S. Marcolino, Erickson R. Nascimento

February 2020 In CVPR, 2020

Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data

The rapid increase in the amount of published visual data and the limited time of users bring the demand for processing untrimmed videos to produce shorter versions that convey the same information. Despite the remarkable progress that has been made by summarization methods, most of them can only select a few frames or skims, which creates visual gaps and breaks the video context. In this paper, we present a novel methodology based on a reinforcement learning formulation to accelerate instructional videos. Our approach is capable of adaptively selecting frames that are not relevant to convey the information without creating gaps in the final video. Our agent is textually and visually oriented to select which frames to remove to shrink the input video. Additionally, we propose a novel network, called Visually-guided Document Attention Network (VDAN), able to generate a highly discriminative embedding space to represent both textual and visual data. Our experiments show that our method achieves the best performance in terms of F1 Score and coverage at the video segment level.

PDF Project Video

Edson Araujo, Michal Gregor, Isabella Huang, Erickson R. Nascimento, Ruzena Bajcsy

November 2019 In IROS, 2019

On Modeling the Effects of Auditory Annoyance on Driving Style and Passenger Comfort

Despite the impressive progress being made in autonomous vehicles, human drivers will remain ubiquitous in the imminent years. Therefore, intelligent hybrid vehicular systems must be aware of the interactions between humans and the environment (e.g., sound, vibration, speed, etc.). In this paper, we evaluate the effect of acoustic annoyance on drivers in a real-world driving study. We found significant differences in driving styles elicited by annoying acoustics and present an online classifier that uses onboard inertial measurement unit measurements to distinguish whether a driver is annoyed with 77% accuracy. Moreover, we directly measured the forces applied on the passenger with a pressure mat lined on the car seat, and empirically confirm that our proposed passenger dynamics model is reasonable. However, due to our acoustically induced driving styles not being polarizing enough, we were unable to show that passengers’ self-reported ride comfort changed with acoustic annoyance.

PDF Video

All Publications

Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data

Washington L. S. Ramos, Michel M. Silva, Edson R. Araujo, Leandro S. Marcolino, Erickson R. Nascimento

PDF Project Video

On Modeling the Effects of Auditory Annoyance on Driving Style and Passenger Comfort

Edson Araujo, Michal Gregor, Isabella Huang, Erickson R. Nascimento, Ruzena Bajcsy

PDF Video

Personalizing Fast-Forward Videos Based on Visual and Textual Features from Social Network

Washington L. S. Ramos, Michel M. Silva, Edson R. Araujo, Alan C. Neves, Erickson R. Nascimento

PDF Project Video

Fast-Forward Methods for Egocentric Videos: A Review

Michel M. Silva, Washington L. S. Ramos, Alan C. Neves, Edson J. Araujo, Mario F. M. Campos, Erickson R. Nascimento

PDF Project

Exploring the Limitations of the Convolutional Neural Networks on Binary Tests Selection for Local Features

Bernardo Biesseck, Edson Araujo, Erickson R. Nascimento

PDF

Matrix: Multihop Address Allocation and Dynamic Any-to-Any Routing for 6LoWPAN

Bruna S. Peres, Otavio A. de O. Souza, Bruno P. Santos, Edson R. A. Junior, Olga Goussevskaia, Marcos A. M. Vieira, Luiz F. M. Vieira, Antonio A.F. Loureiro

PDF

Experience

Visiting Undergraduate Researcher

BAIR, UC Berkeley

Dec 2018 – Mar 2019 Berkeley, CA

Devised several experiments and analysis towards evaluating the effect of acoustic annoyance on drivers in a real-world driving study.

Undergraduate Researcher

VerLab, UFMG

Feb 2018 – Feb 2020 Belo Horizonte, Brazil

Part of the team responsible for implementing features for a Semantic Hyperlapse method using textual information to infer interests from users’ social networks to semantically align them with extracted visual features from the input video.

Undergraduate Researcher

VerLab, UFMG

Aug 2016 – Feb 2018 Belo Horizonte, Brazil

Used Keras and Theano frameworks to model and train a convolutional neural network in order to achieve a precise binary description of images. Our work resulted in an in-depth analysis on the limitations of the use of convolutional neural networks on the problem of binary tests selection.

Undergraduate Researcher

WISEMAP, UFMG

Mar 2015 – Jun 2016 Belo Horizonte, Brazil

Responsible for part of the implementation, using concepts from distributed systems programming, and conception of a platform-independent routing protocol called Matrix.

Undergraduate Researcher

Department of Mathematics, UFMG

Sep 2014 – Jan 2015 Belo Horizonte, Brazil

Worked with Professor Bhalchandra D. Thatte on developing a tool in C to solve and further help understand the limitations of the ‘maximum agreement subtree’ problem.

Edson Araujo

MSc. Student

About Me

Interests

Education

Selected Publications

All Publications

Experience

Visiting Undergraduate Researcher

Undergraduate Researcher

Undergraduate Researcher

Undergraduate Researcher

Undergraduate Researcher

Contact