本文整理了Java中org.apache.spark.api.java.JavaPairRDD.takeSample()
方法的一些代码示例,展示了JavaPairRDD.takeSample()
的具体用法。这些代码示例主要来源于Github
/Stackoverflow
/Maven
等平台,是从一些精选项目中提取出来的代码,具有较强的参考意义,能在一定程度帮忙到你。JavaPairRDD.takeSample()
方法的具体详情如下:
包路径:org.apache.spark.api.java.JavaPairRDD
类名称:JavaPairRDD
方法名:takeSample
暂无
代码示例来源:origin: mahmoudparsian/data-algorithms-book
static List<Vector> getInitialCentroids(
JavaPairRDD<String, Vector> data,
final int K) {
//
List<Tuple2<String, Vector>> centroidTuples = data.takeSample(false, K, 42);
final List<Vector> centroids = new ArrayList<Vector>();
for (Tuple2<String, Vector> t : centroidTuples) {
centroids.add(t._2());
}
return centroids;
}
代码示例来源:origin: ypriverol/spark-java8
static List<Vector> getInitialCentroids(
JavaPairRDD<String, Vector> data,
final int K) {
//
List<Tuple2<String, Vector>> centroidTuples = data.takeSample(false, K, 42);
final List<Vector> centroids = new ArrayList<>();
for (Tuple2<String, Vector> t : centroidTuples) {
centroids.add(t._2());
}
return centroids;
}
代码示例来源:origin: unchartedsoftware/ensemble-clustering
private Map<String, Instance> initKMeans(SparkDataSet ds) {
Map<String, Instance> kmeans = new HashMap<String, Instance>(k);
List<Tuple2<String, Instance>> kpoints = ds.getRDD().takeSample(false, k, (new Random()).nextInt());
for (Tuple2<String, Instance> point : kpoints) {
Cluster c = this.createCluster();
c.add(point._2);
kmeans.put(c.getId(), c);
}
return kmeans;
}
内容来源于网络,如有侵权,请联系作者删除!