xml

Milestone: 1

XML filter. Takes a field that contains XML and expands it into an actual datastructure.

Synopsis

This is what it might look like in your config file:
filter {
  xml {
    add_field => ... # hash (optional), default: {}
    add_tag => ... # array (optional), default: []
    remove_field => ... # array (optional), default: []
    remove_tag => ... # array (optional), default: []
    source => ... # string (optional)
    store_xml => ... # boolean (optional), default: true
    target => ... # string (optional)
    xpath => ... # hash (optional), default: {}
  }
}

Details

add_field

  • Value type is hash
  • Default value is {}

If this filter is successful, add any arbitrary fields to this event. Field names can be dynamic and include parts of the event using the %{field} Example:

filter {
  xml {
    add_field => { "foo_%{somefield}" => "Hello world, from %{host}" }
  }
}

# You can also add multiple fields at once:

filter {
  xml {
    add_field => { 
      "foo_%{somefield}" => "Hello world, from %{host}"
      "new_field" => "new_static_value"
    }
  }
}

If the event has field “somefield” == “hello” this filter, on success, would add field “foo_hello” if it is present, with the value above and the %{host} piece replaced with that value from the event. The second example would also add a hardcoded field.

add_tag

  • Value type is array
  • Default value is []

If this filter is successful, add arbitrary tags to the event. Tags can be dynamic and include parts of the event using the %{field} syntax. Example:

filter {
  xml {
    add_tag => [ "foo_%{somefield}" ]
  }
}

# You can also add multiple tags at once:
filter {
  xml {
    add_tag => [ "foo_%{somefield}", "taggedy_tag"]
  }
}

If the event has field “somefield” == “hello” this filter, on success, would add a tag “foo_hello” (and the second example would of course add a “taggedy_tag” tag).

exclude_tags DEPRECATED

  • DEPRECATED WARNING: This config item is deprecated. It may be removed in a further version.
  • Value type is array
  • Default value is []

Only handle events without all/any (controlled by exclude_any config option) of these tags. Optional.

remove_field

  • Value type is array
  • Default value is []

If this filter is successful, remove arbitrary fields from this event. Fields names can be dynamic and include parts of the event using the %{field} Example:

filter {
  xml {
    remove_field => [ "foo_%{somefield}" ]
  }
}

# You can also remove multiple fields at once:

filter {
  xml {
    remove_field => [ "foo_%{somefield}" "my_extraneous_field" ]
  }
}

If the event has field “somefield” == “hello” this filter, on success, would remove the field with name “foo_hello” if it is present. The second example would remove an additional, non-dynamic field.

remove_tag

  • Value type is array
  • Default value is []

If this filter is successful, remove arbitrary tags from the event. Tags can be dynamic and include parts of the event using the %{field} syntax. Example:

filter {
  xml {
    remove_tag => [ "foo_%{somefield}" ]
  }
}

# You can also remove multiple tags at once:

filter {
  xml {
    remove_tag => [ "foo_%{somefield}", "sad_unwanted_tag"]
  }
}

If the event has field “somefield” == “hello” this filter, on success, would remove the tag “foo_hello” if it is present. The second example would remove a sad, unwanted tag as well.

source

  • Value type is string
  • There is no default value for this setting.

Config for xml to hash is:

source => source_field

For example, if you have the whole xml document in your @message field:

filter {
  xml {
    source => "message"
  }
}

The above would parse the xml from the @message field

store_xml

  • Value type is boolean
  • Default value is true

By default the filter will store the whole parsed xml in the destination field as described above. Setting this to false will prevent that.

tags DEPRECATED

  • DEPRECATED WARNING: This config item is deprecated. It may be removed in a further version.
  • Value type is array
  • Default value is []

Only handle events with all/any (controlled by include_any config option) of these tags. Optional.

target

  • Value type is string
  • There is no default value for this setting.

Define target for placing the data

for example if you want the data to be put in the ‘doc’ field:

filter {
  xml {
    target => "doc"
  }
}

XML in the value of the source field will be expanded into a datastructure in the “target” field. Note: if the “target” field already exists, it will be overridden Required

type DEPRECATED

  • DEPRECATED WARNING: This config item is deprecated. It may be removed in a further version.
  • Value type is string
  • Default value is ""

Note that all of the specified routing options (type,tags.exclude_tags,include_fields,exclude_fields) must be met in order for the event to be handled by the filter. The type to act on. If a type is given, then this filter will only act on messages with the same type. See any input plugin’s “type” attribute for more. Optional.

xpath

  • Value type is hash
  • Default value is {}

xpath will additionally select string values (.to_s on whatever is selected) from parsed XML (using each source field defined using the method above) and place those values in the destination fields. Configuration:

xpath => [ “xpath-syntax”, “destination-field” ]

Values returned by XPath parsring from xpath-synatx will be put in the destination field. Multiple values returned will be pushed onto the destination field as an array. As such, multiple matches across multiple source fields will produce duplicate entries in the field

More on xpath: http://www.w3schools.com/xpath/

The xpath functions are particularly powerful: http://www.w3schools.com/xpath/xpath_functions.asp


This is documentation from lib/logstash/filters/xml.rb