Abstract:With the rapid development of computer science and technology, the application of real-time image processing in embedded field is becoming more and more extensive. However, traditional hardware has not been parallelized because of its own structure. Referring to the issue that high performance image processing solution required on video surveillance, machine vision, video compression, medical image analysis, the solution based on OpenCL software model and FPGA heterogeneous mode is proposed to solve the problem of high performance computing, to realize the image display and OpenCL acceleration, the algorithm is used to analyze the parallelism based on the Sobel edge detection as the research object, using the OpenCL acceleration of the kernel algorithm, which is compared with basic ARM platform and OpenCL shared memory acceleration mechanism, launching performance test, then studying the acceleration effect. It is experimentally showed that the use of the system to deal with different resolution images, OpenCL accelerating subsystem processing will be about 100 times better performance improvement than based on the on-chip ARM hardcore software processing, under achieving the same functions.