Browsing by Author "Wong, Stephan"

Now showing 1 - 8 of 8

Citation - Scopus: 1
Adaptive Embedded Zero Tree for Scalable Video Coding
(int Assoc Engineers-iaeng, 2011) Choupanı, Roya; Choupani, Roya; Tolun, Mehmet Reşit; Wong, Stephan; Tolun, Mehmet R.; Bilgisayar Mühendisliği; Yazılım Mühendisliği
Video streaming over the Internet has gained popularity during recent years mainly due to the revival of video-conferencing and video-telephony applications and the proliferation of (video) content providers. However, the heterogeneous, dynamic, and best-effort nature of the Internet cannot always guarantee a certain bandwidth for an application utilizing the Internet. Scalability has been introduced to deal with such issues (up to a certain point) by adapting the video quality with the available bandwidth. In addition, wavelet based scalability combined with representation methods such as embedded zero trees (EZWs) provides the possibility of reconstructing the video even when only the initial part of the streams have been received. EZW prioritizes the wavelet coefficients based on their energy content. Our experiments however, indicate that giving more priority to low frequency content improves the video quality at a specific bit rate. In this paper, we propose a method to improve on the compression rate of the EZW by prioritizing the coefficients by combining each frequency sub-band with its energy content. Initial experimental show that the first two layers of the generated EZW are about 22.6% more concise.
A Combined Spatial and Frequency Based Texture Model for Organ Segmentation in Computed Tomography Examinations
(Amer Scientific Publishers, 2014) Hassanpour, Reza; Shahbahrami, Asadollah; Wong, Stephan; Jafari, Aref
The organ segmentation in computed tomography (CT) examination is a tedious and error prone task. The local similarity of the pixels from different organs, and the differences between the pixels of the same organ observed in different examinations are two most challenging problems affecting the segmentation process. In this study, statistical and spectral texture properties are combined with the a-priori knowledge about the human body to develop a model for reliably segmenting organs in CT examinations. The main goal of the developed model is fusing local and global statistics to support spatial-frequency analysis and to maximize the simultaneous localization of energy in both spatial and frequency domains. The feature space dimension is reduced by means of a wrapper technique applied as a pre-processing filter. The proposed classifier utilizes a linear combination (ensemble) of two support vector machines (SVM) where the first SVM classifies the input samples according to their textural information and the second one correct the results of the first classifier by searching the spatial information of those samples in a statistical atlas.
Citation - Scopus: 4
A Drift-Reduced Scheme for Hierarchical Wavelet Coding Scalable Video Transmissions
(Ieee, 2009) Choupani, Roya; Wong, Stephan; Tolun, Mehmet R.
Scalable video coding allows for the capability of (partially) decoding a video bitstream when faced with communication deficiencies such as low handwidth or loss of data resulting in lower video quality. As the encoding is usually based on perfectly reconstructed frames, such deficiencies result in differently decoded frames at the decoder than the ones used in the encoder and, therefore, leading to errors being accumulated in the decoder. This is commonly referred to as the drift error. Drift-free scalable video coding methods also suffer from the low performance problem as they do not combine the residue encoding scheme of the current standards such as MPEG-4 and H.264 with scalability characteristics. We propose a scalable video coding method which is based on the motion compensation and residue encoding methods found in current video standards combined with the scalability property of discrete wavelet transform. Our proposed method aims to reduce the drift error while preserving the compression efficiency. Our results show that the drift error has been greatly reduced when a hierarchical structure for frame encoding is introduced.
Hierarchical SNR Scalable Video Coding with Adaptive Quantization for Reduced Drift Error
(2015) Choupani, Roya; Wong, Stephan; Tolun, Mehmet
In video coding, dependencies between frames are being exploited to achieve compression by only coding the differences. This dependency can potentially lead to decoding inaccuracies when there is a communication error, or a deliberate quality reduction due to reduced network or receiver capabilities. The dependency can start at the reference frame and progress through a chain of dependent frames within a group of pictures (GOP) resulting in the so-called drift error. Scalable video coding schemes should deal with such drift errors while maximizing the delivered video quality. In this paper, we present a multi-layer hierarchical structure for scalable video coding capable of reducing the drift error. Moreover, we propose an optimization to adaptively determine the quantization step size for the base and enhancement layers. In addition, we address the trade-off between the drift error and the coding efficiency. The improvements in terms of average PSNR values when one frame in a GOP is lost are 3.70(dB) when only the base layer is delivered, and 4.78(dB) when both the base and the enhancement layers are delivered. The improvements in presence of burst errors are 3.52(dB) when only the base layer is delivered, and 4.50(dB) when both base and enhancement layers are delivered.
Citation - WoS: 4
Citation - Scopus: 7
Multiple Description Coding for Snr Scalable Video Transmission Over Unreliable Networks
(Springer, 2014) Choupani, Roya; Wong, Stephan; Tolun, Mehmet
Streaming multimedia data on best-effort networks such as the Internet requires measures against bandwidth fluctuations and frame loss. Multiple Description Coding (MDC) methods are used to overcome the jitter and delay problems arising from frame losses by making the transmitted data more error resilient. Meanwhile, varying characteristics of receiving devices require adaptation of video data. Data transmission in multiple descriptions provides the feasibility of receiving it partially and hence having a scalable and adaptive video. In this paper, a new method based on integrating MDC and signal-to-noise ratio (SNR) scalable video coding algorithms is proposed. Our method introduces a transform on data to permit transmitting them using independent descriptions. Our results indicate that on average 1.71dB reduction in terms of Y-PSNR occurs if only one description is received.
Citation - WoS: 2
Multiple Description Scalable Coding for Video Transmission Over Unreliable Networks
(Springer-verlag Berlin, 2009) Choupanı, Roya; Choupani, Roya; Tolun, Mehmet Reşit; Wong, Stephan; Tolun, Mehmet R.; Bilgisayar Mühendisliği; Yazılım Mühendisliği
Developing real time multimedia applications for best effort networks such as the Internet requires prohibitions against jitter delay and frame loss. This problem is further complicated in wireless networks as the rate of frame corruption or loss is higher in wireless networks while they generally have lower data rates compared to wired networks. On the other hand, variations of the bandwidth and the receiving device characteristics require data rate adaptation capability of the coding method. Multiple Description Coding (MDC) methods are used to solve the jitter delay and frame loss problems by making the transmitted data more error resilient, however, this results in reduced data rate because of the added overhead. MDC methods do not address the bandwidth variation and receiver characteristics differences. In this paper a new method based on integrating MDC and the scalable video coding extension of H.264 standard is proposed. Our method can handle both jitter delay and frame loss, and data rate adaptation problems. Our method utilizes motion compensating scheme and, therefore, is compatible with the current video coding standards such as MPEG-4 and H.264. Based on the simulated network conditions, our method shows promising results and we have achieved tip to 36dB for average Y-PSNR.
Citation - WoS: 1
Unbalanced Multiple Description Wavelet Coding for Scalable Video Transmission
(Spie-soc Photo-optical instrumentation Engineers, 2012) Choupani, Roya; Wong, Stephan; Tolun, Mehmet
Scalable video coding and multiple description coding are the two different adaptation schemes for video transmission over heterogeneous and best-effort networks such as the Internet. We propose a new method to encode video for unreliable networks with rate adaptation capability. Our proposed method groups three dimensional discrete wavelet transform coefficients in different descriptions and applies a modified embedded zero tree data for rate adaptation. The proposed method optimizes the bit-rates of the descriptions with respect to the channel bit rates and the maximum acceptable distortion. The experimental results in the presence of one description loss indicate that on average the videos at the rate of 1000 Kbit/s are reconstructed with Y-component of peak signal to noise ratio (Y-PSNR) value of 36.2 dB. The dynamic allocation of descriptions to the network channels is optimized for rate distortion minimization. The improvement in term of Y-PSNR achieved by rate distortion optimization has been between 0.7 and 5.3 dB in different bit rates. (c) 2012 SPIE and IS&T. [DOI: 10.1117/1.JEI.21.4.043006]
Using wavelet transform self-similarity for effective multiple description video coding
(IEEE, 2016) Choupani, Roya; Wong, Stephan; Tolun, Mehmet
Video streaming over unreliable networks requires preventive measures to avoid quality deterioration in the presence of packet losses. However, these measures result in redundancy in the transmitted data which is utilized to estimate the missing packets lost in the delivered portions. In this paper, we have used the self-similarity property if the discrete wavelet transform (DWT) to minimize the redundancy and improve the fidelity of the delivered video streams in presence of data loss. Our proposed method decomposes the video into multiple descriptions after applying the DWT. The descriptions are organized in such a way that when one of them is lost during transmission, it is estimated using the delivered portions by means of self-similarity between the DWT coefficients. In our experiments, we compare video reconstruction in the presence of data loss in one or two descriptions. Based on the experimental results, we have ascertained that our estimation method for missing coefficients by means of self-similarity is able to improve the video quality by 2.14dB and 7.26dB in case of one description and two descriptions, respectively. Moreover, our proposed method outperforms the state-of-the-art Forward Error Correction (FEC) method in case of higher bit-rates.