Data Machines
...
Models Supported by Data Machi...
NLP Models

Remove HTML

4min

This model removes HTML tags and attributes from a given block of HTML content and provides a plain text output. This model is useful in conjunction with downstream text analysis to extract entities, relationships and other attributes from a text input by removing the HTML characters.

Model Input Parameters

Parameter Name

Parameter Type

Required

input

Text

Yes

Rest API Input Example

JS


Model Output Result

Parameter Name

Parameter Type

html content

Text

plain text

Text

Rest API Output Example

JSON


Standard Output Parameters

Every model execution output consists of the following standard output parameters

  • input
    • The input string required for the model to extract the categories
  • original input
    • This is the input provided to the first step in model which is retained across multiple steps in a Data Machine workflow.
  • final result
    • The result of the model executed in the final step of the Data Machine workflow
  • sessionid
    • A unique session id that is generated for every execution of a Data Machine which can be used to retain results across multiple sessions
  • status
    • The result of the Data Machine execution. If all of the steps in a sequence are successfully executed, a value of "Completed" is provided. If the execution is interrupted at any point, a value of "Terminated" is provided with the reason for Termination.