@PublicEvolving public class NonSplittingRecursiveEnumerator extends Object implements FileEnumerator
FileEnumerator enumerates all files under the given paths recursively.
Each file becomes one split; this enumerator does not split files into smaller "block" units.
The default instantiation of this enumerator filters files with the common hidden file prefixes '.' and '_'. A custom file filter can be specified.
FileEnumerator.Provider| Constructor and Description |
|---|
NonSplittingRecursiveEnumerator()
Creates a NonSplittingRecursiveEnumerator that enumerates all files except hidden files.
|
NonSplittingRecursiveEnumerator(java.util.function.Predicate<org.apache.flink.core.fs.Path> fileFilter)
Creates a NonSplittingRecursiveEnumerator that uses the given predicate as a filter
for file paths.
|
| Modifier and Type | Method and Description |
|---|---|
protected void |
convertToSourceSplits(org.apache.flink.core.fs.FileStatus file,
org.apache.flink.core.fs.FileSystem fs,
List<FileSourceSplit> target) |
Collection<FileSourceSplit> |
enumerateSplits(org.apache.flink.core.fs.Path[] paths,
int minDesiredSplits)
Generates all file splits for the relevant files under the given paths.
|
protected String |
getNextId() |
public NonSplittingRecursiveEnumerator()
public NonSplittingRecursiveEnumerator(java.util.function.Predicate<org.apache.flink.core.fs.Path> fileFilter)
public Collection<FileSourceSplit> enumerateSplits(org.apache.flink.core.fs.Path[] paths, int minDesiredSplits) throws IOException
FileEnumeratorminDesiredSplits is an optional hint indicating how many splits would be necessary
to exploit parallelism properly.enumerateSplits in interface FileEnumeratorIOExceptionprotected void convertToSourceSplits(org.apache.flink.core.fs.FileStatus file,
org.apache.flink.core.fs.FileSystem fs,
List<FileSourceSplit> target)
throws IOException
IOExceptionprotected final String getNextId()
Copyright © 2014–2020 The Apache Software Foundation. All rights reserved.