Google Cloud Dataflow SDK for Java, version 1.9.1
Package com.google.cloud.dataflow.sdk.values
Defines
PCollection
and other classes for
representing data in a Pipeline
.See: Description
-
Interface Summary Interface Description PCollectionView<T> APCollectionView<T>
is an immutable view of aPCollection
as a value of typeT
that can be accessed as a side input to aParDo
transform.PInput The interface for things that might be input to aPTransform
.POutput The interface for things that might be output from aPTransform
.PValue The interface for values that can be input to and output fromPTransforms
. -
Class Summary Class Description KV<K,V> An immutable key/value pair.KV.OrderByKey<K extends Comparable<? super K>,V> AComparator
that ordersKVs
by the natural ordering of their keys.KV.OrderByValue<K,V extends Comparable<? super V>> AComparator
that ordersKVs
by the natural ordering of their values.PBegin PCollection<T> APCollection<T>
is an immutable collection of values of typeT
.PCollectionList<T> APCollectionList<T>
is an immutable list of homogeneously typedPCollection<T>s
.PCollectionTuple APCollectionTuple
is an immutable tuple of heterogeneously-typedPCollections
, "keyed" byTupleTags
.PDone POutputValueBase APOutputValueBase
is the abstract base class ofPTransform
outputs.PValueBase APValueBase
is an abstract base class that provides sensible default implementations for methods ofPValue
.TimestampedValue<V> An immutable pair of a value and a timestamp.TimestampedValue.TimestampedValueCoder<T> ACoder
forTimestampedValue
.TupleTag<V> ATupleTag
is a typed tag to use as the key of a heterogeneously typed tuple, likePCollectionTuple
.TupleTagList ATupleTagList
is an immutable list of heterogeneously typedTupleTags
.TypeDescriptor<T> A description of a Java type, including actual generic parameters where possible.TypedPValue<T> ATypedPValue<T>
is the abstract base class of things that store some number of values of typeT
. -
Enum Summary Enum Description PCollection.IsBounded The enumeration of cases for whether aPCollection
is bounded.
Package com.google.cloud.dataflow.sdk.values Description
Defines
PCollection
and other classes for
representing data in a Pipeline
.
In particular, see these collection abstractions:
PCollection
- an immutable collection of values of typeT
and the main representation for data in Dataflow.PCollectionView
- an immutable view of aPCollection
that can be accessed as a side input of aParDo
PTransform
.PCollectionTuple
- a heterogeneous tuple ofPCollections
used in cases where aPTransform
takes or returns multiplePCollections
.PCollectionList
- a homogeneous list ofPCollections
used, for example, as input toFlatten
.
And these classes for individual values play particular roles in Dataflow:
KV
- a key/value pair that is used by keyed transforms, most notablyGroupByKey
.TimestampedValue
- a timestamp/value pair that is used for windowing and handling out-of-order data in streaming execution.
For further details, see the documentation for each class in this package.