Current state-of-the-art image segmentation approaches use convolutional decoders to represent objects [1]. Inspite of their success, even when representing simple shapes, the standard decoders involve computing many intermediate feature representation (at different resolutions) and consist of 1M+ parameters. This results in siginificant increase in resource consumption (computation and GPU memory), specially when working with image volumes. As an alternative to convolutional decoders, we would like to explore the possibility of an alternative object representation technique in this project.
The idea is inspired by the Fourier transform in which we try to approximate a signal as a weighted sum of basis functions. Based on this idea, [2] use weighted sum of spherical harmonic functions to represent 3D objects. Extending the idea, [3, 4] uses improved version of the same representation. For instance, [3] demonstrate how an Amygdala (in human brain) can be approximated as a weighted sum of basis functions (see Fig. 1).
The project has two steps,
Datasets:
References:
Back to the project list.
The candidate should have programming experience, ideally in Python and Matlab. Previous experience with machine learning, singal processing and/or computer vision is a plus.
40% Theory, 30% Implementation, 30% Research and experiments
For further information,
send an e-mail.
Contacts:
Udaranga Wickramasinghe (office BC 304)