<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0">
<channel>
<title><![CDATA[向东博客 专注WEB应用 构架之美 --- 构架之美，在于尽态极妍 | 应用之美，在于药到病除]]></title> 
<link>http://jackxiang.com/index.php</link> 
<description><![CDATA[赢在IT，Playin' with IT,Focus on Killer Application,Marketing Meets Technology.]]></description> 
<language>zh-cn</language> 
<copyright><![CDATA[向东博客 专注WEB应用 构架之美 --- 构架之美，在于尽态极妍 | 应用之美，在于药到病除]]></copyright>
<item>
<link>http://jackxiang.com/post//</link>
<title><![CDATA[如何去掉网页header信息]]></title> 
<author>jack &lt;xdy108@126.com&gt;</author>
<category><![CDATA[WEB2.0]]></category>
<pubDate>Wed, 28 Dec 2011 05:41:41 +0000</pubDate> 
<guid>http://jackxiang.com/post//</guid> 
<description>
<![CDATA[ 
	问：<br/>fsockopen - fputs - fget 后得到网页的内容,其中头部包括了那些headr信息的,请问如何能够把这些信息去掉?<br/>就是诸如<br/>HTTP/1.1 200 OK Date: Tue, 23 Mar 2004 19:29:43 GMT Server: Apache/1.3.22 Set-Cookie: BAIDUID=6E67AA72C34CD67E19FA4F0C8A58C9CB; expires=Tue, 23-Mar-34 19:29:43 GMT; path=/; domain=.baidu.com Cache-Control: max-age=86400 Expires: Wed, 24 Mar 2004 19:29:43 GMT Last-Modified: Wed, 17 Mar 2004 18:05:00 GMT ETag: &quot;3979-11ae-4058934c&quot; Accept-Ranges: bytes Content-Length: 4526 Connection: close Content-Type: text/html<br/>这些东西<br/>______________________________________________________________________________________________<br/>答1：<br/>&lt;?php <br/>$str = &#039;<br/>hello HTTP/1.1 200 OK Date: Tue, 23 Mar 2004 19:29:43 GMT Server: Apache/1.3.22 Set-Cookie: BAIDUID=6E67AA72C34CD67E19FA4F0C8A58C9CB; expires=Tue, 23-Mar-34 19:29:43 GMT; path=/; domain=.baidu.com Cache-Control: max-age=86400 Expires: Wed, 24 Mar 2004 19:29:43 GMT Last-Modified: Wed, 17 Mar 2004 18:05:00 GMT ETag: &quot;3979-11ae-4058934c&quot; Accept-Ranges: bytes Content-Length: 4526 Connection: close Content-Type: text/html<br/>&#039;;<br/>$reg = &#039;&#124;HTTP/1.1 200 OK Date: Tue, 23 Mar 2004 19:29:43 GMT Server: Apache/1.3.22 Set-Cookie: BAIDUID=6E67AA72C34CD67E19FA4F0C8A58C9CB; expires=Tue, 23-Mar-34 19:29:43 GMT; path=/; domain=.baidu.com Cache-Control: max-age=86400 Expires: Wed, 24 Mar 2004 19:29:43 GMT Last-Modified: Wed, 17 Mar 2004 18:05:00 GMT ETag: &quot;3979-11ae-4058934c&quot; Accept-Ranges: bytes Content-Length: 4526 Connection: close Content-Type: text/html&#124;&#039;;<br/><br/>$abc = preg_replace($reg,&quot;&quot;,$str);<br/>echo $abc;<br/>?&gt;<br/>______________________________________________________________________________________________<br/>答2：<br/>先把返回的值放到变量中，设为$text<br/>则<br/>$text = preg_replace(&quot;&#124;^.+Content-Type: text/html&#124;&quot;,&quot;&quot;,$text);<br/>______________________________________________________________________________________________<br/>答3：<br/>我前面贴的只是一个例子,但是有些网站返回的header信息不一定是那个样子的,而且也未必是Content-Type: text/html结尾.<br/>如果是那样子就确实比较简单了.<br/>继续求救~~~<br/>curl函数虽然是可以,但是不是系统默认支持的函数库,也不好:(<br/>______________________________________________________________________________________________<br/>答4：<br/>Accept-Ranges: bytes Content-Length: 4526 <br/>表示数据体的长度，你自己分析一下<br/>______________________________________________________________________________________________<br/>答5：<br/>to 唠叨<br/>分析content length确实可以<br/>不过有些网站可能用了类似 ob_start() ... 之类的函数<br/>它的header返回的信息是 Transfer-Encoding: chunked ,而没有content length的!<br/>______________________________________________________________________________________________<br/>答6：<br/>还有一个奇怪的问题是,如果对方网站真的用了ob_start() ... 之类的函数<br/>get回来的网页内容每一小段中间会自己插入一个 1000 在里面,估计是1k个字节打印一个1000出来(有兴趣的网友可以试试).<br/>不过这个东西把原来的网页排版都给搅乱了~~!<br/>______________________________________________________________________________________________<br/>答7：<br/>如上所说，高手快点来<br/>______________________________________________________________________________________________<br/>答8：<br/>顶<br/>______________________________________________________________________________________________<br/>答9：<br/>没人帮忙解答吗？<br/>______________________________________________________________________________________________<br/>答10：<br/>顶<br/>来源：http://study.qqcf.com/web/723/281740.htm<br/>
]]>
</description>
</item><item>
<link>http://jackxiang.com/post//#blogcomment</link>
<title><![CDATA[[评论] 如何去掉网页header信息]]></title> 
<author> &lt;user@domain.com&gt;</author>
<category><![CDATA[评论]]></category>
<pubDate>Thu, 01 Jan 1970 00:00:00 +0000</pubDate> 
<guid>http://jackxiang.com/post//#blogcomment</guid> 
<description>
<![CDATA[ 
	
]]>
</description>
</item>
</channel>
</rss>