org.apache.spark.api.java.JavaPairRDD.takeSample()方法的使用及代码示例

x33g5p2x  于2022-01-21 转载在 其他  
字(1.5k)|赞(0)|评价(0)|浏览(83)

本文整理了Java中org.apache.spark.api.java.JavaPairRDD.takeSample()方法的一些代码示例,展示了JavaPairRDD.takeSample()的具体用法。这些代码示例主要来源于Github/Stackoverflow/Maven等平台,是从一些精选项目中提取出来的代码,具有较强的参考意义,能在一定程度帮忙到你。JavaPairRDD.takeSample()方法的具体详情如下:
包路径:org.apache.spark.api.java.JavaPairRDD
类名称:JavaPairRDD
方法名:takeSample

JavaPairRDD.takeSample介绍

暂无

代码示例

代码示例来源:origin: mahmoudparsian/data-algorithms-book

static List<Vector> getInitialCentroids(
    JavaPairRDD<String, Vector> data, 
    final int K) {
  //
  List<Tuple2<String, Vector>> centroidTuples = data.takeSample(false, K, 42);
  final List<Vector> centroids = new ArrayList<Vector>();
  for (Tuple2<String, Vector> t : centroidTuples) {
    centroids.add(t._2());
  }
  return centroids;
}

代码示例来源:origin: ypriverol/spark-java8

static List<Vector> getInitialCentroids(
    JavaPairRDD<String, Vector> data,
    final int K) {
  //
  List<Tuple2<String, Vector>> centroidTuples = data.takeSample(false, K, 42);
  final List<Vector> centroids = new ArrayList<>();
  for (Tuple2<String, Vector> t : centroidTuples) {
    centroids.add(t._2());
  }
  return centroids;
}

代码示例来源:origin: unchartedsoftware/ensemble-clustering

private Map<String, Instance> initKMeans(SparkDataSet ds) {
  Map<String, Instance> kmeans = new HashMap<String, Instance>(k);
  List<Tuple2<String, Instance>> kpoints = ds.getRDD().takeSample(false, k, (new Random()).nextInt());
  
  for (Tuple2<String, Instance> point : kpoints) {
    Cluster c = this.createCluster();
    c.add(point._2);
    kmeans.put(c.getId(), c);
  }
  return kmeans;
}

相关文章

微信公众号

最新文章

更多

JavaPairRDD类方法