Recently, RGB-D cameras are capable of providing high quality synchronize images of both color and depth. With its advanced sensing capabilities, this technology represents an opportunity to dramatically increase the performance of object detection. However, the depth data has poor texture and far small objects always have low spatial resolution, these raise the problem of developing expressive features for the depth channel. Moreover, objects have a distinct depth contour and shape which is different from ...