你当前正在访问 Microsoft Azure Global Edition 技术文档网站。 如果需要访问由世纪互联运营的 Microsoft Azure 中国技术文档网站,请访问 https://docs.azure.cn。
DocumentWord Class
A word object consisting of a contiguous sequence of characters. For non-space delimited languages, such as Chinese, Japanese, and Korean, each character is represented as its own word.
- Inheritance
-
builtins.objectDocumentWord
Constructor
DocumentWord(**kwargs: Any)
Methods
from_dict |
Converts a dict in the shape of a DocumentWord to the model itself. |
to_dict |
Returns a dict representation of DocumentWord. |
from_dict
Converts a dict in the shape of a DocumentWord to the model itself.
from_dict(data: Dict) -> DocumentWord
Parameters
Name | Description |
---|---|
data
Required
|
A dictionary in the shape of DocumentWord. |
Returns
Type | Description |
---|---|
DocumentWord |
to_dict
Attributes
confidence
Confidence of correctly extracting the word.
confidence: float
content
Text content of the word.
content: str
polygon
Bounding polygon of the word.
polygon: Sequence[Point]
span
Location of the word in the reading order concatenated content.
span: DocumentSpan