- Print
- DarkLight
- PDF
parser_type
usage: required
The parser_type tells the Tag.bio system how to instantiate a data loading function. There are 40+ options for parser_type
. The most commonly used are categorical
and numeric
, which share their designation with the two primary data types represented within the Tag.bio system.
{
"parser_type": "ptpt",
...
}
table_alias
usage: required
The table_alias
attribute tells the system how to assign this parser to a table object as a data loading function for its source data table.
{
"parser_type": "ptpt",
"table_alias": "tata",
...
}
collection
usage: required
The collection
attribute will rename the data processed by a parser to something more useful and human-readable. It is best practice to create legible names, but, if a collection
is not specified, the collection
name will be automatically assigned the column
name.
{
"parser_type": "ptpt",
"table_alias": "tata",
"column": "cccc",
"collection": "ccc1"
}
nesting a parser
It's possible to specify an inner parser as the value of the collection
or variable
attributes. Doing so will define the collection
or variable
names by values in a different column.
{
"parser_type": "ptpt",
"table_alias": "tata",
"column": "cccc",
"collection": {
"parser_type": "ptpt",
"column": "ccc1"
}
}
{
"parser_type": "ptpt",
"table_alias": "tata",
"column": "cccc",
"collection": "ccc1",
"variable": {
"parser_type": "ptpt",
"column": "ccc2"
}
}
where
usage: optional
The where
utilizes a conditional parser to determine which rows should be processed by the parent parser.
Rows that fail to evaluate as true will be ignored.
{
"parser_type": "ptpt",
"table_alias": "tata"
"where": {
"parser_type": "categorical-match",
"column": "city",
"operator": "=",
"value": "San Francisco"
},
...
}
null_indicators
usage: optional
The null_indicators
attribute will specify which values, if found in the source data, will be considered as null.
Values in the array can be categorical
or numeric
.
{
"parser_type": "ptpt",
"table_alias": "tata",
"column": "cccc",
"collection": "ccc1",
"null_indicators": [
"iiii",
####
]
}
null_value
usage: optional
This null_value
attribute is used to replace any null values with a categorical
, numeric
, or nested parser you specify.
{
"parser_type": "tttt",
"table_alias": "tata",
"column": "cccc",
"collection": "ccc1",
"null_indicators": [
"iiii",
####
],
"null_value": "unavailable"
}
annotation
usage: optional
The annotation
will specify an array of inner parsers which will parse other columns to annotate the variables generated by the parent parser.
{
{
"parser_type": "tttt",
"table_alias": "tata",
"column": "cccc",
"collection": "ccc1",
"annotation": {
"parser_type": "tttt",
"column": "ccc2"
}
}
}
groups
usage: optional
When groups
is specified, it will indicate to the system that all collections
and variables
produced by the parser will only be accessible for authorized user groups.
{
"parser_type": "tttt",
"table_alias": "tata",
"column": "cccc",
"collection": "ccc1",
"groups": [
"admin"
]
}