<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html><head><meta http-equiv="Content-Type" content="text/html;charset=iso-8859-1"> <title>Sse_optimizations</title> <link href="doxygen.css" rel="stylesheet" type="text/css"> <link href="tabs.css" rel="stylesheet" type="text/css"> </head><body> <div align="left"><a href="http://www.mrpt.org/">Main MRPT website</a> > <b>C++ reference</b> </div> <div align="right"> <a href="index.html"><img border="0" src="mrpt_logo.png" alt="MRPT logo"></a> </div> <!-- Generated by Doxygen 1.7.5 --> <script type="text/javascript"> var searchBox = new SearchBox("searchBox", "search",false,'Search'); </script> <div id="navrow1" class="tabs"> <ul class="tablist"> <li><a href="index.html"><span>Main Page</span></a></li> <li><a href="pages.html"><span>Related Pages</span></a></li> <li><a href="modules.html"><span>Modules</span></a></li> <li><a href="namespaces.html"><span>Namespaces</span></a></li> <li><a href="annotated.html"><span>Classes</span></a></li> <li><a href="files.html"><span>Files</span></a></li> <li> <div id="MSearchBox" class="MSearchBoxInactive"> <div class="left"> <form id="FSearchBox" action="search.php" method="get"> <img id="MSearchSelect" src="search/mag.png" alt=""/> <input type="text" id="MSearchField" name="query" value="Search" size="20" accesskey="S" onfocus="searchBox.OnSearchFieldFocus(true)" onblur="searchBox.OnSearchFieldFocus(false)"/> </form> </div><div class="right"></div> </div> </li> </ul> </div> </div> <div class="header"> <div class="summary"> <a href="#func-members">Functions</a> </div> <div class="headertitle"> <div class="title">Sse_optimizations</div> </div> </div> <div class="contents"> <hr/><a name="details" id="details"></a><h2>Detailed Description</h2> <p>SSE optimized functions. </p> <table class="memberdecls"> <tr><td colspan="2"><h2><a name="func-members"></a> Functions</h2></td></tr> <tr><td class="memItemLeft" align="right" valign="top">void </td><td class="memItemRight" valign="bottom"><a class="el" href="group__sse__optimizations.html#ga4829c652864d5e5bfc8d91e1993930aa">image_SSSE3_scale_half_3c8u</a> (const uint8_t *in, uint8_t *out, int w, int h)</td></tr> <tr><td class="mdescLeft"> </td><td class="mdescRight">Subsample each 2x2 pixel block into 1x1 pixel, taking the first pixel & ignoring the other 3. <a href="#ga4829c652864d5e5bfc8d91e1993930aa"></a><br/></td></tr> <tr><td class="memTemplParams" colspan="2">template<bool IS_RGB> </td></tr> <tr><td class="memTemplItemLeft" align="right" valign="top">void </td><td class="memTemplItemRight" valign="bottom"><a class="el" href="group__sse__optimizations.html#ga23e43b45f41b2d80acf48341f986e4af">private_image_SSSE3_rgb_or_bgr_to_gray_8u</a> (const uint8_t *in, uint8_t *out, int w, int h)</td></tr> <tr><td class="memItemLeft" align="right" valign="top">void </td><td class="memItemRight" valign="bottom"><a class="el" href="group__sse__optimizations.html#ga55b830ae32a3393bf57af1c782096eed">image_SSSE3_bgr_to_gray_8u</a> (const uint8_t *in, uint8_t *out, int w, int h)</td></tr> <tr><td class="mdescLeft"> </td><td class="mdescRight">Convert a RGB image (3cu8) into a GRAYSCALE (1c8u) image, using Y=77*R+150*G+29*B. <a href="#ga55b830ae32a3393bf57af1c782096eed"></a><br/></td></tr> <tr><td class="memItemLeft" align="right" valign="top">void </td><td class="memItemRight" valign="bottom"><a class="el" href="group__sse__optimizations.html#ga45eca9cc60ed408bc72735f47c2f88d1">image_SSSE3_rgb_to_gray_8u</a> (const uint8_t *in, uint8_t *out, int w, int h)</td></tr> <tr><td class="mdescLeft"> </td><td class="mdescRight">Convert a RGB image (3cu8) into a GRAYSCALE (1c8u) image, using Y=77*R+150*G+29*B. <a href="#ga45eca9cc60ed408bc72735f47c2f88d1"></a><br/></td></tr> <tr><td class="memItemLeft" align="right" valign="top">void </td><td class="memItemRight" valign="bottom"><a class="el" href="group__sse__optimizations.html#ga2ea8698db2e2cb58774eac64b8db0721">image_SSE2_scale_half_1c8u</a> (const uint8_t *in, uint8_t *out, int w, int h)</td></tr> <tr><td class="mdescLeft"> </td><td class="mdescRight">Subsample each 2x2 pixel block into 1x1 pixel, taking the first pixel & ignoring the other 3. <a href="#ga2ea8698db2e2cb58774eac64b8db0721"></a><br/></td></tr> <tr><td class="memItemLeft" align="right" valign="top">void </td><td class="memItemRight" valign="bottom"><a class="el" href="group__sse__optimizations.html#gadd3e6ad655aed605c78f988688b8a986">image_SSE2_scale_half_smooth_1c8u</a> (const uint8_t *in, uint8_t *out, int w, int h)</td></tr> <tr><td class="mdescLeft"> </td><td class="mdescRight">Average each 2x2 pixels into 1x1 pixel (arithmetic average) <a href="#gadd3e6ad655aed605c78f988688b8a986"></a><br/></td></tr> <tr><td class="memItemLeft" align="right" valign="top">float </td><td class="memItemRight" valign="bottom"><a class="el" href="group__sse__optimizations.html#ga0546ed6bea3d509c18dabb603eb8bf94">KLT_response_optimized</a> ()</td></tr> <tr><td class="mdescLeft"> </td><td class="mdescRight">KLT score at a given point of a grayscale image. <a href="#ga0546ed6bea3d509c18dabb603eb8bf94"></a><br/></td></tr> </table> <hr/><h2>Function Documentation</h2> <a class="anchor" id="ga2ea8698db2e2cb58774eac64b8db0721"></a><!-- doxytag: member="CImage_SSE2.cpp::image_SSE2_scale_half_1c8u" ref="ga2ea8698db2e2cb58774eac64b8db0721" args="(const uint8_t *in, uint8_t *out, int w, int h)" --> <div class="memitem"> <div class="memproto"> <table class="memname"> <tr> <td class="memname">void image_SSE2_scale_half_1c8u </td> <td>(</td> <td class="paramtype">const uint8_t * </td> <td class="paramname"><em>in</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">uint8_t * </td> <td class="paramname"><em>out</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>w</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>h</em> </td> </tr> <tr> <td></td> <td>)</td> <td></td><td></td> </tr> </table> </div> <div class="memdoc"> <p>Subsample each 2x2 pixel block into 1x1 pixel, taking the first pixel & ignoring the other 3. </p> <ul> <li><b>Input format:</b> uint8_t, 1 channel</li> <li><b>Output format:</b> uint8_t, 1 channel</li> <li><b>Preconditions:</b> in & out aligned to 16bytes, w = k*16 (w=width in pixels), widthStep=w*1</li> <li><b>Notes:</b></li> <li><b>Requires:</b> SSE2</li> <li><b>Invoked from:</b> <a class="el" href="classmrpt_1_1utils_1_1_c_image.html#a4081b970421194842f33e085e3d64208" title="Returns a new image scaled down to half its original size.">mrpt::utils::CImage::scaleHalf()</a> </li> </ul> <p>Definition at line <a class="el" href="_c_image___s_s_e2_8cpp_source.html#l00059">59</a> of file <a class="el" href="_c_image___s_s_e2_8cpp_source.html">CImage_SSE2.cpp</a>.</p> </div> </div> <a class="anchor" id="gadd3e6ad655aed605c78f988688b8a986"></a><!-- doxytag: member="CImage_SSE2.cpp::image_SSE2_scale_half_smooth_1c8u" ref="gadd3e6ad655aed605c78f988688b8a986" args="(const uint8_t *in, uint8_t *out, int w, int h)" --> <div class="memitem"> <div class="memproto"> <table class="memname"> <tr> <td class="memname">void image_SSE2_scale_half_smooth_1c8u </td> <td>(</td> <td class="paramtype">const uint8_t * </td> <td class="paramname"><em>in</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">uint8_t * </td> <td class="paramname"><em>out</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>w</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>h</em> </td> </tr> <tr> <td></td> <td>)</td> <td></td><td></td> </tr> </table> </div> <div class="memdoc"> <p>Average each 2x2 pixels into 1x1 pixel (arithmetic average) </p> <ul> <li><b>Input format:</b> uint8_t, 1 channel</li> <li><b>Output format:</b> uint8_t, 1 channel</li> <li><b>Preconditions:</b> in & out aligned to 16bytes, w = k*16 (w=width in pixels), widthStep=w*1</li> <li><b>Notes:</b></li> <li><b>Requires:</b> SSE2</li> <li><b>Invoked from:</b> <a class="el" href="classmrpt_1_1utils_1_1_c_image.html#a70e6eb2c8d81eebf6acc189eb694f59d" title="Returns a new image scaled down to half its original size (averaging between every two rows) On odd s...">mrpt::utils::CImage::scaleHalfSmooth()</a> </li> </ul> <p>Definition at line <a class="el" href="_c_image___s_s_e2_8cpp_source.html#l00089">89</a> of file <a class="el" href="_c_image___s_s_e2_8cpp_source.html">CImage_SSE2.cpp</a>.</p> </div> </div> <a class="anchor" id="ga55b830ae32a3393bf57af1c782096eed"></a><!-- doxytag: member="CImage_SSE3.cpp::image_SSSE3_bgr_to_gray_8u" ref="ga55b830ae32a3393bf57af1c782096eed" args="(const uint8_t *in, uint8_t *out, int w, int h)" --> <div class="memitem"> <div class="memproto"> <table class="memname"> <tr> <td class="memname">void image_SSSE3_bgr_to_gray_8u </td> <td>(</td> <td class="paramtype">const uint8_t * </td> <td class="paramname"><em>in</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">uint8_t * </td> <td class="paramname"><em>out</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>w</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>h</em> </td> </tr> <tr> <td></td> <td>)</td> <td></td><td></td> </tr> </table> </div> <div class="memdoc"> <p>Convert a RGB image (3cu8) into a GRAYSCALE (1c8u) image, using Y=77*R+150*G+29*B. </p> <ul> <li><b>Input format:</b> uint8_t, 3 channels (BGR order)</li> <li><b>Output format:</b> uint8_t, 1 channel</li> <li><b>Preconditions:</b> in & out aligned to 16bytes, w = k*16 (w=width in pixels), widthStep=w*3 and w*1</li> <li><b>Notes:</b></li> <li><b>Requires:</b> SSSE3</li> <li><b>Invoked from:</b> <a class="el" href="classmrpt_1_1utils_1_1_c_image.html#ad6c9a96cfc910c8a80a477e554f2094c" title="Returns a grayscale version of the image, or itself if it is already a grayscale image.">mrpt::utils::CImage::grayscale()</a>, <a class="el" href="classmrpt_1_1utils_1_1_c_image.html#adb14c43c957e98285ec0e3750487aeee" title="Replaces the image with a grayscale version of it.">mrpt::utils::CImage::grayscaleInPlace()</a> </li> </ul> <p>Definition at line <a class="el" href="_c_image___s_s_e3_8cpp_source.html#l00227">227</a> of file <a class="el" href="_c_image___s_s_e3_8cpp_source.html">CImage_SSE3.cpp</a>.</p> </div> </div> <a class="anchor" id="ga45eca9cc60ed408bc72735f47c2f88d1"></a><!-- doxytag: member="CImage_SSE3.cpp::image_SSSE3_rgb_to_gray_8u" ref="ga45eca9cc60ed408bc72735f47c2f88d1" args="(const uint8_t *in, uint8_t *out, int w, int h)" --> <div class="memitem"> <div class="memproto"> <table class="memname"> <tr> <td class="memname">void image_SSSE3_rgb_to_gray_8u </td> <td>(</td> <td class="paramtype">const uint8_t * </td> <td class="paramname"><em>in</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">uint8_t * </td> <td class="paramname"><em>out</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>w</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>h</em> </td> </tr> <tr> <td></td> <td>)</td> <td></td><td></td> </tr> </table> </div> <div class="memdoc"> <p>Convert a RGB image (3cu8) into a GRAYSCALE (1c8u) image, using Y=77*R+150*G+29*B. </p> <ul> <li><b>Input format:</b> uint8_t, 3 channels (RGB order)</li> <li><b>Output format:</b> uint8_t, 1 channel</li> <li><b>Preconditions:</b> in & out aligned to 16bytes, w = k*16 (w=width in pixels), widthStep=w*3 and w*1</li> <li><b>Notes:</b></li> <li><b>Requires:</b> SSSE3</li> <li><b>Invoked from:</b> <a class="el" href="classmrpt_1_1utils_1_1_c_image.html#ad6c9a96cfc910c8a80a477e554f2094c" title="Returns a grayscale version of the image, or itself if it is already a grayscale image.">mrpt::utils::CImage::grayscale()</a>, <a class="el" href="classmrpt_1_1utils_1_1_c_image.html#adb14c43c957e98285ec0e3750487aeee" title="Replaces the image with a grayscale version of it.">mrpt::utils::CImage::grayscaleInPlace()</a> </li> </ul> <p>Definition at line <a class="el" href="_c_image___s_s_e3_8cpp_source.html#l00240">240</a> of file <a class="el" href="_c_image___s_s_e3_8cpp_source.html">CImage_SSE3.cpp</a>.</p> </div> </div> <a class="anchor" id="ga4829c652864d5e5bfc8d91e1993930aa"></a><!-- doxytag: member="CImage_SSE3.cpp::image_SSSE3_scale_half_3c8u" ref="ga4829c652864d5e5bfc8d91e1993930aa" args="(const uint8_t *in, uint8_t *out, int w, int h)" --> <div class="memitem"> <div class="memproto"> <table class="memname"> <tr> <td class="memname">void image_SSSE3_scale_half_3c8u </td> <td>(</td> <td class="paramtype">const uint8_t * </td> <td class="paramname"><em>in</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">uint8_t * </td> <td class="paramname"><em>out</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>w</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>h</em> </td> </tr> <tr> <td></td> <td>)</td> <td></td><td></td> </tr> </table> </div> <div class="memdoc"> <p>Subsample each 2x2 pixel block into 1x1 pixel, taking the first pixel & ignoring the other 3. </p> <ul> <li><b>Input format:</b> uint8_t, 3 channels (RGB or BGR)</li> <li><b>Output format:</b> uint8_t, 3 channels (RGB or BGR)</li> <li><b>Preconditions:</b> in & out aligned to 16bytes, w = k*16 (w=width in pixels), widthStep=w*3</li> <li><b>Notes:</b></li> <li><b>Requires:</b> SSSE3</li> <li><b>Invoked from:</b> <a class="el" href="classmrpt_1_1utils_1_1_c_image.html#a4081b970421194842f33e085e3d64208" title="Returns a new image scaled down to half its original size.">mrpt::utils::CImage::scaleHalf()</a> </li> </ul> <p>Definition at line <a class="el" href="_c_image___s_s_e3_8cpp_source.html#l00056">56</a> of file <a class="el" href="_c_image___s_s_e3_8cpp_source.html">CImage_SSE3.cpp</a>.</p> </div> </div> <a class="anchor" id="ga0546ed6bea3d509c18dabb603eb8bf94"></a><!-- doxytag: member="CImage_SSE2.cpp::KLT_response_optimized" ref="ga0546ed6bea3d509c18dabb603eb8bf94" args="()" --> <div class="memitem"> <div class="memproto"> <table class="memname"> <tr> <td class="memname">float KLT_response_optimized </td> <td>(</td> <td class="paramname"></td><td>)</td> <td></td> </tr> </table> </div> <div class="memdoc"> <p>KLT score at a given point of a grayscale image. </p> <ul> <li><b>Requires:</b> SSE2</li> <li><b>Invoked from:</b> <a class="el" href="classmrpt_1_1utils_1_1_c_image.html#afd0dbc513f07e36a3764e674fdff6c1e" title="Compute the KLT response at a given pixel (x,y) - Only for grayscale images (for efficiency it avoids...">mrpt::utils::CImage::KLT_response()</a></li> </ul> <p>This function is not manually optimized for SSE2 but templatized for different window sizes such as the compiler can optimize automatically for that size.</p> <p>Only for the most common window sizes this templates are instantiated (W=[2-16] and W=32 ), falling back to a generic implementation otherwise. The next figure shows the performance (time for KLT_response() to compute the score for one single pixel) for different window sizes.</p> <div class="image"> <img src="KLT_response_performance_SSE2.png" alt="KLT_response_performance_SSE2.png"/> </div> </div> </div> <a class="anchor" id="ga23e43b45f41b2d80acf48341f986e4af"></a><!-- doxytag: member="CImage_SSE3.cpp::private_image_SSSE3_rgb_or_bgr_to_gray_8u" ref="ga23e43b45f41b2d80acf48341f986e4af" args="(const uint8_t *in, uint8_t *out, int w, int h)" --> <div class="memitem"> <div class="memproto"> <div class="memtemplate"> template<bool IS_RGB> </div> <table class="memname"> <tr> <td class="memname">void private_image_SSSE3_rgb_or_bgr_to_gray_8u </td> <td>(</td> <td class="paramtype">const uint8_t * </td> <td class="paramname"><em>in</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">uint8_t * </td> <td class="paramname"><em>out</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>w</em>, </td> </tr> <tr> <td class="paramkey"></td> <td></td> <td class="paramtype">int </td> <td class="paramname"><em>h</em> </td> </tr> <tr> <td></td> <td>)</td> <td></td><td></td> </tr> </table> </div> <div class="memdoc"> <p>Definition at line <a class="el" href="_c_image___s_s_e3_8cpp_source.html#l00109">109</a> of file <a class="el" href="_c_image___s_s_e3_8cpp_source.html">CImage_SSE3.cpp</a>.</p> <p>References <a class="el" href="_s_s_e__macros_8h_source.html#l00033">BUILD_128BIT_CONST</a>.</p> </div> </div> </div> <br><hr><br> <table border="0" width="100%"> <tr> <td> Page generated by <a href="http://www.doxygen.org" target="_blank">Doxygen 1.7.5</a> for MRPT 0.9.5 SVN: at Sun Sep 25 17:20:18 UTC 2011</td><td></td> <td width="100"> </td> <td width="150"> </td></tr> </table> </body></html>