不知道为什么现在各大搜索引擎编码居然不一样.当然不是GB2312就是UTF-8了.编码问题是比较头疼的问题...头疼的不要命...
<br>我们获得关键词,一般是通过来访页面的url进行分析的.比如
<br>http://www.google.com/search?hl=zh-CN&q=%E5%AD%A4%E7%8B%AC&lr=
<br>各位肯定知道这个是通过urlencode编码的.
<br>我们得到其中的信息,需要进行2步.第一步是进行urldecode,在我们普通参数活得的时候,这个是由ASP自己来进行的,但是现在我们不得不进行手工解码.
<br>网上函数很多,但都是针对于GB2312页面解GB2312.UTF-8的.对于这个,我们可以很轻松的先进行解码,然后根据搜索引擎判断它的编码,如果是UTF-8就再转换为GB2312.
<br>但是由于我的网站是UTF-8页面的.而UTF-8页面我找到的只有解UTF-8字符的urldecode编码的.在这里停顿了很久,最后我只能用最糟糕的方法,把拆分出来的关键词用xmlhttp提交到一个GB2312的ASP页面,然后活得乱码(GB2312)后再进行GB2312 to UTF-8的转换.
<br>下面主要实现代码.
<br>Public Function GetSearchKeyword(RefererUrl) 搜索关键词
<br> if RefererUrl="" or len(RefererUrl)<1 then exit function
<br><br> on error resume next
<br><br> Dim re
<br> Set re = New RegExp
<br> re.IgnoreCase = True
<br> re.Global = True
<br> Dim a,b,j
<br> 模糊查找关键词,此方法速度较快,范围也较大
<br> re.Pattern = "(word=([^&]*)|q=([^&]*)|p=([^&]*)|query=([^&]*)|name=([^&]*)|_searchkey=([^&]*)|baidu.*?w=([^&]*))"
<br> Set a = re.Execute(RefererUrl)
<br> If a.Count>0 then
<br> Set b = a(a.Count-1).SubMatches
<br> For j=1 to b.Count
<br> If Len(b(j))>0 then
<br> if instr(1,RefererUrl,"google",1) then
<br> GetSearchKeyword=Trim(U8Decode(b(j)))
<br> elseif instr(1,refererurl,"yahoo",1) then
<br> GetSearchKeyword=Trim(U8Decode(b(j)))
<br> elseif instr(1,refererurl,"yisou",1) then
<br> GetSearchKeyword=Trim(getkey(b(j)))
<br> elseif instr(1,refererurl,"3721",1) then
<br> GetSearchKeyword=Trim(getkey(b(j)))
<br> else
<br> GetSearchKeyword=Trim(getkey(b(j)))
<br> end if
<br> Exit Function
<br> end if
<br> Next
<br> End If
<br> if err then
<br> err.clear
<br> GetSearchKeyword = RefererUrl
<br> else
<br> GetSearchKeyword = ""
<br> end if
<br> End Function
<br><br> Function URLEncoding(vstrIn)
<br> dim strReturn,i,thischr
<br> strReturn = ""
<br> For i = 1 To Len(vstrIn)
<br> ThisChr = Mid(vStrIn,i,1)
<br> If Abs(Asc(ThisChr)) < &HFF Then
<br> strReturn = strReturn & ThisChr
<br> Else
<br> innerCode = Asc(ThisChr)
<br> If innerCode < 0 Then
<br> innerCode = innerCode + &H10000
<br> End If
<br> Hight8 = (innerCode And &HFF00)\ &HFF
<br> Low8 = innerCode And &HFF
<br> strReturn = strReturn & "%" & Hex(Hight8) & "%" & Hex(Low8)
<br> End If
<br> Next
<br> URLEncoding = strReturn
<br>End Function
<br>function getkey(key)
<br>dim oreq
<br>set oreq = CreateObject("MSXML2.XMLHTTP")
<br>oReq.open "POST","http://&qu |
|