使用 Google 文档 API 合并邮件

本指南介绍了如何使用 Google 文档 API 执行邮件合并。

简介

邮件合并会从电子表格或其他数据源的行中提取值 并将其插入到模板文档中。这样,您就可以 您可以基于该文档(模板)生成许多类似的文档, 以合并数据进行自定义该结果不一定用于 但可用于任何用途,例如生成一批 客户账单。

只要有电子表格和单词,邮件合并功能就一直存在 并已融入当今许多业务工作流中。惯例是 将数据整理为每行一条记录,而列表示 数据,如下表所示:

名称 地址 可用区
1 UrbanPq 123 1st St. 西
2 Pawxana 第二街 456 号

本页面上的示例应用展示了如何使用 Google 文档、 Google 表格和 Drive API 将 有关如何执行邮件合并的详细信息,以保护用户免受实施 问题。有关此 Python 示例的更多信息,请参阅该示例的 GitHub 代码库

示例应用

此示例应用会复制您的主模板,然后合并来自您的 指定的数据源复制到每个副本中。如需试用此示例应用,请先 设置模板:

  1. 创建 Google 文档文件。选择要使用的模板。
  2. 记下新文件的文档 ID。有关详情,请参阅文档 ID
  3. DOCS_FILE_ID 变量设置为文档 ID。
  4. 将联系信息替换为模板占位符变量,应用会将这些变量与所选数据合并。

这里是一封信件示例 模板 其中包含可以与来自普通数据源的真实数据合并的占位符 文本或表格。该模板如下所示:

接下来,选择纯文本或表格作为数据源 使用 SOURCE 变量。此示例默认为纯文本,表示 示例数据使用 TEXT_SOURCE_DATA 变量。数据来源 将 SOURCE 变量更新为 'sheets' 并指向它 我们的示例 工作表 (或您自己的),方法是设置 SHEETS_FILE_ID 变量。

该工作表如下所示,因此您可以看到其格式:

使用我们的示例数据试用该应用,然后根据您的数据和使用情形对其进行调整。命令行应用的运作方式如下:

  • 设置
  • 从数据源提取数据
  • 循环遍历每行数据
    • 创建模板副本
    • 将副本与数据合并
    • 指向新合并文档的输出链接

所有新合并的字母也会显示在用户的“我的云端硬盘”中。一个 合并后的字母示例如下:

源代码

Python

docs/mail-merge/docs_mail_merge.py
import time

import google.auth
from googleapiclient.discovery import build
from googleapiclient.errors import HttpError

# Fill-in IDs of your Docs template & any Sheets data source
DOCS_FILE_ID = "195j9eDD3ccgjQRttHhJPymLJUCOUjs-jmwTrekvdjFE"
SHEETS_FILE_ID = "11pPEzi1vCMNbdpqaQx4N43rKmxvZlgEHE9GqpYoEsWw"

# authorization constants

SCOPES = (  # iterable or space-delimited string
    "https://www.googleapis.com/auth/drive",
    "https://www.googleapis.com/auth/documents",
    "https://www.googleapis.com/auth/spreadsheets.readonly",
)

# application constants
SOURCES = ("text", "sheets")
SOURCE = "text"  # Choose one of the data SOURCES
COLUMNS = ["to_name", "to_title", "to_company", "to_address"]
TEXT_SOURCE_DATA = (
    (
        "Ms. Lara Brown",
        "Googler",
        "Google NYC",
        "111 8th Ave\nNew York, NY  10011-5201",
    ),
    (
        "Mr. Jeff Erson",
        "Googler",
        "Google NYC",
        "76 9th Ave\nNew York, NY  10011-4962",
    ),
)

# fill-in your data to merge into document template variables
merge = {
    # sender data
    "my_name": "Ayme A. Coder",
    "my_address": "1600 Amphitheatre Pkwy\nMountain View, CA  94043-1351",
    "my_email": "http://google.com",
    "my_phone": "+1-650-253-0000",
    # - - - - - - - - - - - - - - - - - - - - - - - - - -
    # recipient data (supplied by 'text' or 'sheets' data source)
    "to_name": None,
    "to_title": None,
    "to_company": None,
    "to_address": None,
    # - - - - - - - - - - - - - - - - - - - - - - - - - -
    "date": time.strftime("%Y %B %d"),
    # - - - - - - - - - - - - - - - - - - - - - - - - - -
    "body": (
        "Google, headquartered in Mountain View, unveiled the new "
        "Android phone at the Consumer Electronics Show. CEO Sundar "
        "Pichai said in his keynote that users love their new phones."
    ),
}

creds, _ = google.auth.default()
# pylint: disable=maybe-no-member

# service endpoints to Google APIs

DRIVE = build("drive", "v2", credentials=creds)
DOCS = build("docs", "v1", credentials=creds)
SHEETS = build("sheets", "v4", credentials=creds)


def get_data(source):
  """Gets mail merge data from chosen data source."""
  try:
    if source not in {"sheets", "text"}:
      raise ValueError(
          f"ERROR: unsupported source {source}; choose from {SOURCES}"
      )
    return SAFE_DISPATCH[source]()
  except HttpError as error:
    print(f"An error occurred: {error}")
    return error


def _get_text_data():
  """(private) Returns plain text data; can alter to read from CSV file."""
  return TEXT_SOURCE_DATA


def _get_sheets_data(service=SHEETS):
  """(private) Returns data from Google Sheets source. It gets all rows of
  'Sheet1' (the default Sheet in a new spreadsheet), but drops the first
  (header) row. Use any desired data range (in standard A1 notation).
  """
  return (
      service.spreadsheets()
      .values()
      .get(spreadsheetId=SHEETS_FILE_ID, range="Sheet1")
      .execute()
      .get("values")[1:]
  )
  # skip header row


# data source dispatch table [better alternative vs. eval()]
SAFE_DISPATCH = {k: globals().get(f"_get_{k}_data") for k in SOURCES}


def _copy_template(tmpl_id, source, service):
  """(private) Copies letter template document using Drive API then
  returns file ID of (new) copy.
  """
  try:
    body = {"name": f"Merged form letter ({source})"}
    return (
        service.files()
        .copy(body=body, fileId=tmpl_id, fields="id")
        .execute()
        .get("id")
    )
  except HttpError as error:
    print(f"An error occurred: {error}")
    return error


def merge_template(tmpl_id, source, service):
  """Copies template document and merges data into newly-minted copy then
  returns its file ID.
  """
  try:
    # copy template and set context data struct for merging template values
    copy_id = _copy_template(tmpl_id, source, service)
    context = merge.iteritems() if hasattr({}, "iteritems") else merge.items()

    # "search & replace" API requests for mail merge substitutions
    reqs = [
        {
            "replaceAllText": {
                "containsText": {
                    "text": "{{%s}}" % key.upper(),  # {{VARS}} are uppercase
                    "matchCase": True,
                },
                "replaceText": value,
            }
        }
        for key, value in context
    ]

    # send requests to Docs API to do actual merge
    DOCS.documents().batchUpdate(
        body={"requests": reqs}, documentId=copy_id, fields=""
    ).execute()
    return copy_id
  except HttpError as error:
    print(f"An error occurred: {error}")
    return error


if __name__ == "__main__":
  # get row data, then loop through & process each form letter
  data = get_data(SOURCE)  # get data from data source
  for i, row in enumerate(data):
    merge.update(dict(zip(COLUMNS, row)))
    print(
        "Merged letter %d: docs.google.com/document/d/%s/edit"
        % (i + 1, merge_template(DOCS_FILE_ID, SOURCE, DRIVE))
    )

如需了解详情,请参阅 README 文件和完整的应用源代码 可在示例应用的GitHub repo