us.codecraft.webmagic.Page.setRequest()方法的使用及代码示例

x33g5p2x  于2022-01-26 转载在 其他  
字(3.5k)|赞(0)|评价(0)|浏览(141)

本文整理了Java中us.codecraft.webmagic.Page.setRequest()方法的一些代码示例,展示了Page.setRequest()的具体用法。这些代码示例主要来源于Github/Stackoverflow/Maven等平台,是从一些精选项目中提取出来的代码,具有较强的参考意义,能在一定程度帮忙到你。Page.setRequest()方法的具体详情如下:
包路径:us.codecraft.webmagic.Page
类名称:Page
方法名:setRequest

Page.setRequest介绍

暂无

代码示例

代码示例来源:origin: code4craft/webmagic

protected Page handleResponse(Request request, String charset, HttpResponse httpResponse, Task task) throws IOException {
  byte[] bytes = IOUtils.toByteArray(httpResponse.getEntity().getContent());
  String contentType = httpResponse.getEntity().getContentType() == null ? "" : httpResponse.getEntity().getContentType().getValue();
  Page page = new Page();
  page.setBytes(bytes);
  if (!request.isBinaryContent()){
    if (charset == null) {
      charset = getHtmlCharset(contentType, bytes);
    }
    page.setCharset(charset);
    page.setRawText(new String(bytes, charset));
  }
  page.setUrl(new PlainText(request.getUrl()));
  page.setRequest(request);
  page.setStatusCode(httpResponse.getStatusLine().getStatusCode());
  page.setDownloadSuccess(true);
  if (responseHeader) {
    page.setHeaders(HttpClientUtils.convertHeaders(httpResponse.getAllHeaders()));
  }
  return page;
}

代码示例来源:origin: code4craft/webmagic

@Override
public Page download(Request request, Task task) {
  if (logger.isInfoEnabled()) {
    logger.info("downloading page: " + request.getUrl());
  }
  String content = getPage(request);
  if (content.contains("HTTP request failed")) {
    for (int i = 1; i <= getRetryNum(); i++) {
      content = getPage(request);
      if (!content.contains("HTTP request failed")) {
        break;
      }
    }
    if (content.contains("HTTP request failed")) {
      //when failed
      Page page = new Page();
      page.setRequest(request);
      return page;
    }
  }
  Page page = new Page();
  page.setRawText(content);
  page.setUrl(new PlainText(request.getUrl()));
  page.setRequest(request);
  page.setStatusCode(200);
  return page;
}

代码示例来源:origin: code4craft/webmagic

page.setHtml(new Html(content, request.getUrl()));
page.setUrl(new PlainText(request.getUrl()));
page.setRequest(request);
webDriverPool.returnToPool(webDriver);
return page;

代码示例来源:origin: us.codecraft/webmagic-core

protected Page handleResponse(Request request, String charset, HttpResponse httpResponse, Task task) throws IOException {
  byte[] bytes = IOUtils.toByteArray(httpResponse.getEntity().getContent());
  String contentType = httpResponse.getEntity().getContentType() == null ? "" : httpResponse.getEntity().getContentType().getValue();
  Page page = new Page();
  page.setBytes(bytes);
  if (!request.isBinaryContent()){
    if (charset == null) {
      charset = getHtmlCharset(contentType, bytes);
    }
    page.setCharset(charset);
    page.setRawText(new String(bytes, charset));
  }
  page.setUrl(new PlainText(request.getUrl()));
  page.setRequest(request);
  page.setStatusCode(httpResponse.getStatusLine().getStatusCode());
  page.setDownloadSuccess(true);
  if (responseHeader) {
    page.setHeaders(HttpClientUtils.convertHeaders(httpResponse.getAllHeaders()));
  }
  return page;
}

代码示例来源:origin: QiuMing/zhihuWebSpider

protected Page handleResponse(Request request, String charset, HttpResponse httpResponse, Task task) throws IOException {
  String content = getContent(charset, httpResponse);
  Page page = new Page();
  page.setRawText(content);
  page.setUrl(new PlainText(request.getUrl()));
  page.setRequest(request);
  page.setStatusCode(httpResponse.getStatusLine().getStatusCode());
  return page;
}

代码示例来源:origin: seawaylee/doubanWebSpider

protected Page handleResponse(Request request, String charset, HttpResponse httpResponse, Task task) throws IOException {
  String content = getContent(charset, httpResponse);
  Page page = new Page();
  page.setRawText(content);
  page.setUrl(new PlainText(request.getUrl()));
  page.setRequest(request);
  page.setStatusCode(httpResponse.getStatusLine().getStatusCode());
  return page;
}

相关文章