https 远程获取内容的编码问题 - CNode技术社区

var https = require(‘https’); var punycode = require(‘punycode’) https.get({ host: ‘sp0.baidu.com’, path: ‘/8aQDcjqpAAV3otqbppnN2DJv/api.php?query=202.194.101.150&co=&resource_id=6006’ }, function(res) { res.on(‘data’, function(d) { d.toString(‘utf8’,0,d.length) });

}).on(‘error’, function(e) { console.error(e.toString()); });

网页的代码返回的是个 gbk的编码 , res.on 返回的是个 buffer 还不知道是个上面编码，utf8 转出来中文乱码，求如何解决，还是必须用其他的爬虫模块

alsotang 1楼•10 年前

用 request 这个库，直接使用 https 模块太烦了

dingyong666 2楼•10 年前作者

不用了，明显是百度的接口坑爹！！！换api了。

dingyong666 3楼•10 年前作者

@alsotang request 换个这个后还是乱码！！百度真是坑爹，就不能把中文转一下码。。

magicdawn 4楼•10 年前

用 superagent 然后加一个 superagent-charset

.charset(‘gbk’)

alsotang 5楼•10 年前

@magicdawn 用 superagentparse…

superagentparse 可以获取 buffer
superagentparse 用的是 superagent 原生的 parse 方法。。。

https://github.com/alsotang/superagentparse

magicdawn 6楼•10 年前

klesh 7楼•10 年前

用 iconv-lite 转码

回到顶部