Organiza tus páginas con colecciones
Guarda y categoriza el contenido según tus preferencias.
Puedes convertir facturas en datos estructurados en Cloud Data Fusion con el complemento Invoice Parser, que se basa en Document AI. Los datos
estructurados se almacenan en BigQuery.
Antes de comenzar
Para analizar facturas, necesitas una instancia de Cloud Data Fusion que se ejecute en la versión 6.4.1 o una posterior. Para obtener más información, consulta
Cómo actualizar instancias de Cloud Data Fusion.
Crea un procesador
En la Google Cloud consola, ve a la página Procesadores
de Document AI.
Asegúrate de que la instancia deseada se haya actualizado a la versión 6.4.1 o una posterior. Para versiones anteriores, actualiza la instancia.
Haz clic en Ver instancia.
Se abrirá la IU de Cloud Data Fusion.
Haz clic en Hub.
Haz clic en GCP y, luego, implementa los complementos de GCP.
Haz clic en DocAI y, luego, implementa los complementos de Doc AI.
Haz clic en el Instructivo de Invoice Parser>Crear.
Para personalizar tu canalización, ingresa el ID del procesador de Invoice Parser, la ruta de acceso del bucket de Cloud Storage y los detalles de la tabla de BigQuery.
Implementa y ejecuta la canalización
Las facturas analizadas se almacenan en la tabla de salida de BigQuery.
Los metadatos de las facturas se almacenan en la tabla Metadata y, entre ellos, se incluyen el estado de análisis, la ruta de acceso de Cloud Storage y la marca de tiempo de carga de la factura sin procesar. Los registros de las tablas de salida y metadatos se pueden unir con la clave invoice_uuid.
[[["Fácil de comprender","easyToUnderstand","thumb-up"],["Resolvió mi problema","solvedMyProblem","thumb-up"],["Otro","otherUp","thumb-up"]],[["Difícil de entender","hardToUnderstand","thumb-down"],["Información o código de muestra incorrectos","incorrectInformationOrSampleCode","thumb-down"],["Faltan la información o los ejemplos que necesito","missingTheInformationSamplesINeed","thumb-down"],["Problema de traducción","translationIssue","thumb-down"],["Otro","otherDown","thumb-down"]],["Última actualización: 2025-09-04 (UTC)"],[[["\u003cp\u003eThe Invoice Parser plugin in Cloud Data Fusion, powered by Document AI, allows you to convert invoices into structured data.\u003c/p\u003e\n"],["\u003cp\u003eParsed data from invoices is stored in BigQuery, with invoice metadata stored in a separate table for additional information.\u003c/p\u003e\n"],["\u003cp\u003eTo use the Invoice Parser plugin, you must have a Cloud Data Fusion instance running version 6.4.1 or later and you need to have created an Invoice Parser processor in Document AI.\u003c/p\u003e\n"],["\u003cp\u003eThe pipeline is customizable, allowing you to specify the Invoice Parser processor ID, Cloud Storage bucket path, and BigQuery table details during configuration.\u003c/p\u003e\n"]]],[],null,["# Parse invoices\n\nYou can convert invoices into structured data in Cloud Data Fusion\nusing the Invoice Parser plugin, which is powered by Document AI. The\nstructured data gets stored in BigQuery.\n\nBefore you begin\n----------------\n\nTo parse invoices, you need a Cloud Data Fusion instance running in\nversion 6.4.1 or later. For more information, see\n[Upgrading Cloud Data Fusion instances](/data-fusion/docs/how-to/upgrading#upgrade-instances).\n\nCreate a processor\n------------------\n\n1. In the Google Cloud console, go to the Document AI **Processors**\n page.\n\n [Go to Processors](https://console.cloud.google.com/ai/document-ai/processors)\n2. [Create a processor](/document-ai/docs/create-processor). Select **Invoice\n parser** as the type of processor.\n\n | **Note:** When you create the processor, copy the processor ID for your plugin configurations.\n\nConfigure the invoice parser plugin\n-----------------------------------\n\n1. In the Google Cloud console, go to the Cloud Data Fusion **Instances**\n page.\n\n [Go to Instances](https://console.cloud.google.com/data-fusion/locations/-/instances)\n2. Ensure that the desired instance has been upgraded to version 6.4.1 or\n later. For earlier versions, [upgrade the instance](/data-fusion/docs/how-to/upgrading).\n\n3. Click **View instance**.\n The Cloud Data Fusion UI opens.\n\n4. Click **Hub**.\n\n5. Click **GCP** , and then deploy **GCP Plugins**.\n\n6. Click **DocAI** , and then deploy the **Doc AI Plugins**.\n\n7. Click the **Invoice Parser Quickstart** \\\u003e **Create**.\n\n8. Customize your pipeline by entering the Invoice Parser processor ID,\n Cloud Storage bucket path, and BigQuery table details.\n\n9. Deploy and run the pipeline.\n\nParsed invoices are stored in the output table in BigQuery.\nMetadata from the invoices is stored in the `Metadata` table and includes\nparsing status, Cloud Storage path, and upload timestamp of the raw\ninvoice. Records in the output and metadata tables can be joined with the\n`invoice_uuid` key."]]