Overview
Utilice las siguientes opciones de configuración para configurar el inicio del conector de origen MongoDB Kafka para convertir colecciones MongoDB en eventos Change Stream.
Tip
Para ver un ejemplo del uso de la función de copia existente, consulte la Ejemplode uso de copia de datos existentes.
Para obtener una lista de las configuraciones del conector de origen organizadas por categoría, consulte la guía sobre Propiedades de configuración del conector de origen.
Configuraciones
Nombre | Descripción | |
|---|---|---|
startup.mode | Type: string Description: Specifies how the connector should start up when there is no
source offset available. Resuming a change stream requires a
resume token, which the connector gets from the source offset.
If no source offset is available, the connector may either
ignore all or some of the existing source data, or may at first
copy all existing source data and then continue with processing
new data. If startup.mode=latest, the connector ignores all existing
source data.If startup.mode=timestamp, the connector
actuates startup.mode.timestamp.* properties. If no
properties are configured, timestamp is equivalent to
latest.If startup.mode=copy_existing, the connector
copies all existing source data to Change Stream events. This
setting is equivalent to the deprecated setting copy.existing=true.Si algún sistema modifica los datos de la base de datos mientras el conector de origen convierte los datos existentes, MongoDB puede generar eventos de flujo de cambios duplicados para reflejar los cambios más recientes. Dado que los eventos de flujo de cambios en los que se basa la copia de datos son idempotentes, los datos copiados son consistentes, cumpliendo con la garantía de entrega "al menos una vez". Default: latestAccepted Values: latest, timestamp, copy_existing | |
startup.mode.timestamp.start.at.operation.time | Type: string Description: Actuated only if startup.mode=timestamp. Specifies the
starting point for the change stream.To learn more about Change Stream parameters, see
$changeStream (aggregation)
in the MongoDB manual. Default: ""Accepted Values:
| |
startup.mode.copy.existing.namespace.regex | Type: string Description: Regular expression the connector uses to match namespaces from
which to copy data. A namespace describes the MongoDB database name
and collection separated by a period (for example, databaseName.collectionName).For example, the following regular-expression setting matches
collections that start with "page" in the stats database:The \ character in the example above escapes the . character
that follows it in the regular expression. For more information on
how to build regular expressions, see
Patterns
in the Java API documentation.Default: ""Accepted Values: A valid regular expression | |
startup.mode.copy.existing.pipeline | Type: string Description: An inline array of pipeline operations
the connector runs when copying existing data. You can use this
setting to filter the source collection and improve the use of
indexes in the copying process. For example, the following setting uses the $match
aggregation operator to instruct the connector to copy only
documents that contain a closed field with a value of false.Default: ""Accepted Values: Valid aggregation pipeline stages | |
startup.mode.copy.existing.max.threads | Type: int Description: The maximum number of threads the connector can use to copy data. Default: number of processors available in the environment Accepted Values: An integer | |
startup.mode.copy.existing.queue.size | Type: int Description: The size of the queue the connector can use when copying data. Default: 16000Accepted Values: An integer | |
startup.mode.copy.existing.allow.disk.use | Type: boolean Description: When set to true, the connector uses temporary disk storage
for the copy existing aggregation.Default: trueAccepted Values: true or false |