Delen via


Content Conversion (Preview)

A service that allows content to be converted from one format to another.

This connector is available in the following products and regions:

Service Class Regions
Logic Apps Standard All Logic Apps regions except the following:
     -   Azure China regions
     -   US Department of Defense (DoD)
Power Automate Standard All Power Automate regions except the following:
     -   China Cloud operated by 21Vianet
Power Apps Standard All Power Apps regions except the following:
     -   China Cloud operated by 21Vianet
Contact
Name Microsoft
URL Microsoft LogicApps Support
Microsoft Power Automate Support
Microsoft Power Apps Support
Connector Metadata
Publisher Microsoft

Known Issues and Limitations

If you are facing issues with unexpected line breaks, incorrect formatting, or missing hyperlinks, the issues are by design because the Content Conversion connector does not keep the formatting of the HTML when it converts to plain text. Instead, it removes the hyperlinks. To workaround this issue, please try using different string functions in expressions to replace the content. For more details, please refer to this page.

This connector is no longer available in US Government (GCC) or US Government (GCC High) regions for creating new implementations. Already existing implementations will remain unaffected.

Element-specific behavior of Content Conversion Service
  • The max line length is 80 characters, afterwards a line break will follow.
  • For link elements that follow the structure <a href='link'>text</a> the result becomes text[link]. If text and link are the same, only text will be present.
  • Headers (<h1>, <h2>, etc) are uppercased.
  • Heading cells (<th>) are uppercased.
  • Empty lines will be trimmed as a space saving measure.
  • Unordered lists will use * as prefix.
  • There will be 3 spaces between data table columns
  • There will be 0 empty lines between data table rows.
  • Links with href='#...' will be ignored.
  • New lines \n from the input HTML will be collapsed into space as any other HTML whitespace characters.
  • As a result of the above, when dealing with continuous new line characters \n, the output might contain a blank space between them, for example \n\n \n\n.

General Limits

Name Value
Maximum content length (in MB) 5
Maximum allowed depth in the HTML DOM tree 70

Throttling Limits

Name Calls Renewal Period
API calls per connection 600 60 seconds

Actions

Html to text

Converts an HTML document to plain text.

Html to text

Converts an HTML document to plain text.

Parameters

Name Key Required Type Description
Content
Content True html

The HTML document to be converted to plain text.

Returns

The plain text content.

Definitions

HtmlToTextResponse

The plain text content.

The plain text content.

The plain text content.
string