AI
Make AI Content Extractor
22 min
the make ai content extractor app is currently in beta there may be changes to functionality or pricing make ai content extractor is a built in app that extracts structured text and metadata from files like pdfs, word documents, images, and audio recordings—directly within your {{product name}} {{scenario plural lowercase}} it also converts unstructured inputs into a clean format suitable for ai apps, all without relying on third party services make ai content extractor ensures consistent output that integrates seamlessly with ai modules and ai based automations, making it ideal for tasks like pulling invoice fields from a scanned pdf, transcribing a voice note, or analyzing whatsapp image attachments key benefits no external accounts no need to connect or manage a separate account everything works within your existing {{product name}} environment privacy focused your data is never stored or used for ai training instant setup start using the app immediately in your {{pl}} there is no need to create any connection structured output no matter the file type or layout, the app returns your data in the same, perfectly organized format every time credit usage the make ai content extractor app has a specific credit usage for each module you can find the details below extract text from a document this module uses 1 0 credits for each page processed example if you extract text from a document with 3 pages, it costs 30 credits extract information from an invoice this module uses 1 0 credits for each operation example if you extract information from two invoices, the module will complete two operations and use 20 credits extract information from a receipt this module uses 1 0 credits for each operation example if you extract information from two receipts, the module will complete two operations and use 20 credits generate a caption for an image this module uses 2 credits for each operation example if you generate captions for two images, the module will complete two operations and use 4 credits generate captions for an image (advanced) this module uses 2 credits for each operation example if you generate multiple captions for two images, the module will complete two operations and use 4 credits describe an image this module uses 2 credits for each operation example if you describe two images, the module will complete two operations and use 4 credits extract text from an image this module uses 2 credits for each operation example if you extract text from two images, the module will complete two operations and use 4 credits generate image tags this module uses 2 credits for each operation example if you generate tags for two images, the module will complete two operations and use 4 credits detect objects in an image this module uses 2 credits for each operation example if you detect objects in two images, the module will complete two operations and use 4 credits transcribe an audio file this module uses 20 credits for each minute of audio processed example if you transcribe an audio file with 3 5 minutes, it costs 70 credits translate an audio file this module uses 20 credits for each minute of audio processed example if you translate an audio file with 3 5 minutes, it costs 70 credits for more detailed information about credits, refer to the credits documentation make ai content extractor modules all modules support two ways to upload a file via a publicly accessible url you can paste a publicly accessible url directly into the module to extract information from the file mapping a file from your cloud storage if your file is stored on a cloud service like google drive, onedrive, or similar, first use a module to download the file (e g , google drive download a file) next, add a required make ai content extractor module and choose to extract by file you can use the following types of modules to build your {{scenario plural lowercase}} the current features and functionality are still in beta and are actively being improved as such, pricing is subject to change and will be updated as the app evolves document extract text from a document extracts text from any type of document field description document source select how you want to provide the content for extraction file url supported formats pdf , jpeg/jpg , png , bmp , tiff , heif , docx , xlsx , pptx , html limitation for pdf and tiff, up to 2,000 pages can be processed the maximum file size is 500 mb image dimensions must be between 50 x 50 and 10,000 x 10,000 pixels if your pdfs are password locked, you must remove the lock before submission file select or map the file you want to extract data from file name enter (map) the name of the file you want to extract data enter (map) the file data document url enter the publicly accessible url for the document page ranges enter specific page numbers or ranges to extract information from for example 1,3 6 limit enter the maximum number of pages to extract leave this field empty to return all pages in the document if page ranges is defined, this field will be ignored to see the available languages, refer to the microsoft learn language support page example this example shows the output (extracted information) generated from the provided pdf file extract information from an invoice extracts the details from an invoice field description invoice source select how you want to provide the invoice file file url supported formats pdf , jpeg/jpg , png , bmp , tiff , heif , docx , xlsx , pptx , html limitation for pdf and tiff, up to 2,000 pages can be processed the maximum file size is 500 mb image dimensions must be between 50 x 50 and 10,000 x 10,000 pixels if your pdfs are password locked, you must remove the lock before submission file select or map the file you want to extract data from file name enter (map) the name of the file you want to extract data enter (map) the file data invoice url enter the publicly accessible url for the invoice to see the available languages, refer to the microsoft learn language support page example this example shows the output (extracted invoice details) generated from the provided invoice extract information from a receipt extracts the details from a receipt field description extract by select how you want to provide the receipt file url supported formats pdf , jpeg/jpg , png , bmp , tiff , heif , docx , xlsx , pptx , html limitation for pdf and tiff, up to 2,000 pages can be processed the maximum file size is 500 mb image dimensions must be between 50 x 50 and 10,000 x 10,000 pixels if your pdfs are password locked, you must remove the lock before submission file select or map the file you want to extract data from file name enter (map) the name of the file you want to extract data enter (map) the file data receipt url enter the publicly accessible url for the receipt to see the available languages, refer to the microsoft learn language support page example this example shows the output (extracted receipt details) generated from the provided receipt image generate a c aption for an image generates a one sentence caption describing an image's content field description image source select how you want to provide the image file url supported formats jpeg/jpg , png , gif , bmp , webp , ico , tiff , mpo limitation images must be less than 20 mb and have dimensions between 50 x 50 and 16,000 x 16,000 pixels file select or map the file you want to caption file name enter (map) the name of the file you want to caption data enter (map) the file data image url enter a public url to the image you want to caption gender neutral caption select whether to generate captions using gender neutral terms for example, terms such person and child will be used instead of woman or man and boy or girl in english example this example shows the output (image caption) generated from the provided image url generate c aptions for an image (advanced) generates up to 10 captions describing different parts of an image each caption will be a separate bundle and processed individually in the rest of the scenario field description image source select how you want to provide the image file url supported formats jpeg/jpg , png , gif , bmp , webp , ico , tiff , mpo limitation images must be less than 20 mb and have dimensions between 50 x 50 and 16,000 x 16,000 pixels file select or map the file you want to caption file name enter the name of the file you want to caption data enter (map) the file data image url enter a public url to the image you want to caption gender neutral caption select whether to generate captions using gender neutral terms for example, terms such person and child will be used instead of woman or man and boy or girl in english example this example shows the output (image captions) generated from the provided image url describe an image describes an image in detail field description image source select how you want to provide the image file url limitation images must be less than 4 mb and have dimensions smaller than 33 megapixels file select or map the file you want to describe file name enter (map) the name of the file you want to caption data enter (map) the file data image url enter a public url to the image you want to caption the file size of the image must be less than 20 mb the dimensions of the image must be less than 33 megapixels prompt enter custom instructions to guide how the ai should describe the image for example, specify the writing style of the image description or provide background context temperature adjust how creative or predictable the response is lower values make the output more focused, while higher values make it more diverse must be lower than or equal to 1 top p limit the response to the most likely words a lower value makes the output more focused by narrowing the range of possible words must be lower than or equal to 1 example this example shows the output (image description) generated from the provided image url extract text from an image extracts printed or handwritten text from an image for other formats, use the extract text from a document module instead field description image source select how you want to provide the image file url s upported formats jpeg/jpg , png , gif , bmp , webp , ico , tiff , mpo limitation images must be less than 20 mb and have dimensions between 50 x 50 and 16,000 x 16,000 pixels supported formats include file select or map the file you want to describe file name enter (map) the name of the image you want to extract text from data enter (map) the file data image url enter a public url to the image you want to extract text from limit enter the maximum number of text blocks to return leave this empty to return all recognized text blocks in the image example this example shows the output (text) generated from the provided image url generate image tags generates a list of words related to the image field description image source select how you want to provide the image file url supported formats jpeg/jpg , png , gif , bmp , webp , ico , tiff , mpo limitation images must be less than 20 mb and have dimensions between 50 x 50 and 16,000 x 16,000 pixels file select or map the image you want to receive tags from file name enter (map) the name of the image you want to receive tags from data enter (map) the file data image url enter a public url to the image you want to receive tags from limit enter the maximum number of returned tags leave this field empty to return all recognized tags for the image example this example shows the output (image tags) generated from the provided image url detect objects in an image detects different objects in an image, including their approximate location within the image field description image source select how you want to provide the image file url supported formats jpeg/jpg , png , gif , bmp , webp , ico , tiff , mpo limitation images must be less than 20 mb and have dimensions between 50 x 50 and 16,000 x 16,000 pixels file select or map the image you want to detect objects from file name enter (map) the name of the image you want to detect objects from data enter (map) the file data image url enter a public url to the image you want to detect objects from ignore duplicate objects select whether you want to ignore duplicate objects in the output limit enter the maximum number of returned tags leave this empty to return all recognized objects on the image provided example this example shows the output (objects in an image) generated from the provided image url speech transcribe an audio file transcribes an audio file field description audio source select how you want to provide the audio file url supported formats wav , mp3 , opus/ogg , flac , wma , aac , alaw in wav container, mulaw in wav container, amr , webm , speex limitation audios must be less than 2 hours long and 300 mb in size file select or map the audio file you want to transcribe file name enter (map) the name of the audio file you want to transcribe data enter (map) the file data language select one or more expected languages for the audio profanity filter mode select how the transcription handles profane language channels enter (map) the indices of audio channels to transcribe separately up to two channels are supported unless diarization is enabled by default, this tool merges all input channels into a single channel and then performs the transcription if this isn't desirable, channels can be transcribed independently without merging diarization enable this to let the tool identify and distinguish between different speakers if you enable diarization, you can also set the expected maximum number of speakers example this example shows the output (audio transcript) generated from the provided audio url translate an audio file translates an audio file to english field description audio source select how you want to provide the audio file url supported formats flac , mp3 , mp4 , mpeg , mpga , m4a , ogg , wav , webm limitation audios must be less than 25 mb in size file select or map the audio file you want to translate file name enter (map) the name of the audio file you want to translate data enter (map) the file data audio url enter a public url to the audio you want to translate file name the file name is taken from the url by default if the original name is missing a supported audio extension (like mp3 or wav ) or causes an error, please enter a new name to help our ai correctly identify the file type prompt enter a prompt to guide the model's style or specify how to spell unfamiliar words this is limited to 224 tokens temperature enter the randomness of the translation it controls the randomness of the response and the likeliness of the ai selecting less probable words or phrases for translations, we recommend the default value of 0 must be lower than or equal to 1 example this example shows the output (translation) generated from the provided audio url templates you can look for make ai content extractor templates in make's template gallery , where you'll find thousands of pre created {{scenario plural lowercase}}