📜  构建 RDD 列表的并集 - Python 代码示例

📅  最后修改于: 2022-03-11 14:45:06.678000             🧑  作者: Mango

代码示例1
# Build the union of a list of RDDs

path = os.path.join(tempdir, "union-text.txt")
with open(path, "w") as testFile:
  _ = testFile.write("Hello")
textFile = sc.textFile(path)
textFile.collect()
# ['Hello']
parallelized = sc.parallelize(["World!"])
sorted(sc.union([textFile, parallelized]).collect())
# ['Hello', 'World!']