SIV864/PHM 961 Special Module on Multimedia Processing and Communication

Potential Projects

Suggested topics for projects: These are only suggested topics for the projects. One can choose other topics that are related to the theme of the course. The project can be done by a group of two (MAX) students. Each group must inform by email about the names of the students in the group, topic of your project and schedule/plan of execution by Nov 12, 2018 to pkkiitd@gmail.com. The outcome of the project can include at-least one of the following:

The evaluation of the project will be done on:

The following is only a suggested list of projects. You can select a project outside this list. PLEASE NOTE THAT YOUR SELECTION OF A PROJECT SHOULD NOT OVERLAP WITH YOUR OTHER PROJECTS

  1. Audio/Sound Enhancement: Implement and compare some basic audio restoration techniques (noise removal, echo cancellation); and survey commercially/public-domain available products and compare their performances.
  2. Digital Music Processing: Aspects related to representation, processing, analysis, synthesis and understanding of digital music. The project may indicate some public domain software tools which can be used for these operations. There are several places where a course on Digital Music is taught, the project may list some of these.
  3. Speaker Identification: Review the principle of speaker identification in the literature, survey products available commercially and compare their performances and applications.
  4. Speech Recognition: Review the principle of speech recognition in the literature, survey products available commercially and compare their performances and applications.
  5. JPEG Implementation: Implement JPEG image coding and compare with some existing encoders.
  6. JPEG 2000: Describe the principles of Image Coding in JPEG 2000 and compare features and intended applications vis-a-vis JPEG.
  7. Image Enhancement/Processing: Implement and compare some basic image processing techniques (noise removal, segmentation); and survey commercially (or public-domain) available products and compare their performances.
  8. Content Based Image Retrieval: Review the principle of content based image retrieval (techniques that use visual contents to search images) in the literature. Survey products available commercially and compare their performances and applications. A reference is here.
  9. Face Detection and Recognition: Survey algorithms for face detection and/or recognition and compare their performances. For face recognition can visit here.
  10. Study and Implementation of Motion Estimation (used in MPEG): Implement some algorithms for Motion Estimation and compare their performances.
  11. Low Bit Video Conferencing: Study and review of techniques for video conferencing on low bit rate for the task of face-to-face communication. A reference is here.
  12. Video on Internet: Implement a simple video application (e.g. streaming a video from one computer to another, with or without transcoding) using the Java Media Framework (http://java.sun.com/products/java-media/jmf). The JMF has libraries for many audio/video coding standards and RTP/RTCP. You must know JAVA programming.
  13. Multimedia Communication Systems: Survey commercial products for different video applications over different networks/access links (video-on-demand, video streaming, video conferencing, video telephony, over wired/wireless modem, ADSL, cable, satellite, LAN, fiber, etc.).
  14. QoS for Video Applications: Study and describe the differences in QoS requirements (delay vs. quality) for different video applications (video telephony, video conferencing, video streaming, video-on-demand), and surveying Internet Protocols enabling real-time applications (RTP/RSVP/RTSP, etc.)
  15. Multimedia Watermarking: A watermark is a code which can be embedded in the host data to help copyright protection. [ Reference gives a good overview]. The study may include discussion/implementation on a specific media (audio/image/video etc.) with some pointers to commercial products available.
  16. Multiple Descriptions Coding (MDC): Review the techniques and applications of MDC. MDC is a coding technique which fragments a single media stream into n independent sub streams (n >= 2) referred to as descriptions. The idea of MDC is to provide error resilience to media streams. A reference is here.
  17. Scalable Video Coding (SVC): Basic approach in SVC and its applications. A reference is here.
  18. Multimedia application development on smart phones:
  19. Geometry Processing and Compression: Review the principles of compression techniques for geometry data. A reference is here.
  20. Streaming Animation Data: Review the principles of streaming techniques for 3D animation data. A reference is here.
  21. SIP Related Project:

For selection of a topic you may refer to the following Journals:

1. The ACM Transactions on Multimedia Computing, Communications, and Applications
2. IEEE Transaction on Multimedia
3. IEEE Multimedia