Stay organized with collections
Save and categorize content based on your preferences.
Cloud Search indexes all items that are sent, regardless of file type
(MIME or content-type). Indexing is performed on a file's metadata data and,
if supported, its content. Following is a list of file types for which content
indexing is supported.
Microsoft Word (DOC)
Microsoft Word (DOCX)
Microsoft Excel (XLS)
Microsoft Excel (XLSX)
Microsoft Powerpoint (PPT)
Microsoft Powerpoint (PPTX)
Adobe’s Portable Document Format (PDF)
Rich Text Format (RTF)
Text Format (TXT)
Hypertext Markup Language (HTML)
Extensible Markup Language (XML)
In addition to these file types, Cloud Search supports indexing of content
within any plain text file.
Optical Character Recognition (OCR) file types and characteristics
Google Cloud Search also uses OCR to extract text from the following file types:
File type
Maximum size
Joint Photographic Experts Group (JPG)
10 MB
Graphic Interchange Format (GIF)
10 MB
Tagged Image File Format (TIFF)
10 MB
Scalable Vector Graphics (SVG)
10 MB
PostScript Image Format (PS)
10 MB
Portable Document Format (PDF)
30 MB
OCR also works on files with these characteristics:
Hand-written documents. Documents in Latin script, Japanese, and Korean yield
the best results.
Vertically-written documents, such as those in Japanese.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-09-03 UTC."],[[["Cloud Search indexes metadata for all file types and content for specific supported formats like Microsoft Office, PDF, RTF, TXT, HTML, and XML, as well as any plain text file."],["Cloud Search uses Optical Character Recognition (OCR) to extract text from image file types such as JPG, GIF, TIFF, SVG, PS, and PDFs (under certain conditions and size limits)."],["OCR technology in Cloud Search supports various document characteristics, including handwritten documents (Latin, Japanese, Korean), vertically written documents (e.g., Japanese), and right-to-left written documents (e.g., Hebrew)."]]],[]]