edu.uci.ics.crawler4j.url.WebURL.getParentUrl()方法的使用及代码示例

x33g5p2x  于2022-02-03 转载在 其他  
字(2.1k)|赞(0)|评价(0)|浏览(82)

本文整理了Java中edu.uci.ics.crawler4j.url.WebURL.getParentUrl()方法的一些代码示例,展示了WebURL.getParentUrl()的具体用法。这些代码示例主要来源于Github/Stackoverflow/Maven等平台,是从一些精选项目中提取出来的代码,具有较强的参考意义,能在一定程度帮忙到你。WebURL.getParentUrl()方法的具体详情如下:
包路径:edu.uci.ics.crawler4j.url.WebURL
类名称:WebURL
方法名:getParentUrl

WebURL.getParentUrl介绍

暂无

代码示例

代码示例来源:origin: yasserg/crawler4j

@Override
  public void objectToEntry(WebURL url, TupleOutput output) {
    output.writeString(url.getURL());
    output.writeInt(url.getDocid());
    output.writeInt(url.getParentDocid());
    output.writeString(url.getParentUrl());
    output.writeShort(url.getDepth());
    output.writeByte(url.getPriority());
    output.writeString(url.getAnchor());
  }
}

代码示例来源:origin: yasserg/crawler4j

webURL.setURL(movedToUrl);
webURL.setParentDocid(curURL.getParentDocid());
webURL.setParentUrl(curURL.getParentUrl());
webURL.setDepth(curURL.getDepth());
webURL.setDocid(-1);

代码示例来源:origin: biezhi/java-library-examples

@Override
  protected void handlePageStatusCode(WebURL webUrl, int statusCode, String statusDescription) {

    if (statusCode != HttpStatus.SC_OK) {

      if (statusCode == HttpStatus.SC_NOT_FOUND) {
        logger.warn("Broken link: {}, this link was found in page: {}", webUrl.getURL(),
            webUrl.getParentUrl());
      } else {
        logger.warn("Non success status for link: {} status code: {}, description: ",
            webUrl.getURL(), statusCode, statusDescription);
      }
    }
  }
}

代码示例来源:origin: biezhi/java-library-examples

String path = page.getWebURL().getPath();
String subDomain = page.getWebURL().getSubDomain();
String parentUrl = page.getWebURL().getParentUrl();
String anchor = page.getWebURL().getAnchor();

代码示例来源:origin: edu.uci.ics/crawler4j

@Override
  public void objectToEntry(WebURL url, TupleOutput output) {
    output.writeString(url.getURL());
    output.writeInt(url.getDocid());
    output.writeInt(url.getParentDocid());
    output.writeString(url.getParentUrl());
    output.writeShort(url.getDepth());
    output.writeByte(url.getPriority());
    output.writeString(url.getAnchor());
  }
}

代码示例来源:origin: edu.uci.ics/crawler4j

webURL.setURL(movedToUrl);
webURL.setParentDocid(curURL.getParentDocid());
webURL.setParentUrl(curURL.getParentUrl());
webURL.setDepth(curURL.getDepth());
webURL.setDocid(-1);

相关文章