MMI Labs is working with KNIME and its active community to bring marketing methodologies to its offering, extending its capabilities from its more traditional uses in statistics and general data analysis. To download our nodes, download the KNIME platform. Once KNIME is installed, you can access our public nodes under the community nodes section of the nodes repository.

MMI Nodes

Keyword Density

This node leverages Jsoup to parse HTML content and Lucene to tokenize terms and generate a list of term frequency. You can also configure it to filter out certain keywords that should not be excluded in the report.

 

Clean HTML Retriever

This node takes a URL and retrieves its content (assuming it is in HTML form) for parsing within KNIME. If HTML content is available in another column, it can use the HTML content directly. The content is then parsed and cleaned using HtmlCleaner to output it into XHTML form. The results can be configured to output in either String or XML type to read content from the page for analysis or storage.

 

XML Sitemap Reader

This node accepts a sitemap URL input and produces sitemap entries in an output table. It also supports sitemap index. Note: Sitemap content must be in XML form and compatible with the specifications found in http://www.sitemaps.org/.

 

Google PageSpeed Insight

This node submits URL(s) to Google’s PageSpeed Insights service to pull PageSpeed scores and other metrics provided by the PageSpeed API.

 

Moz Metrics

This node uses Mozscape API Connector to pull link and URL metrics data about specific pages on a domain.

 

Random Data Generator

This node creates a data table with random data based on definitions specified for each field column. Up to 1,000 field columns can be created. The number of rows generated can be configured.

 

Twitter Trends

This node uses Twitter’s API Connector in order to pull trends data. It will first make a call to trends/available to download a list of available cities/countries and populated at the Place drop down. Select a country or city and it will then make a call to trends/place to pull down Twitter Trends data for that specific city or country.

 

W3C CSS Validator

This node submits URL(s) to W3C CSS Validator service in order to evaluate the quality of the CSS source files. It uses W3C’s host http://jigsaw.w3.org/css-validator/validator by default. Note: An unpublicized quota limit on W3C’s host can result in an IP address being banned from W3C’s website. To avoid exceeding the quota limit, MMI advises users to set up their own validator host.

 

W3C HTML Validator

This node submits URL(s) to W3C Nu HTML Validator service to evaluate the quality of the HTML source. It uses W3C’s host https://validator.w3.org/nu/ by default. Note: There is an unpublicized quota limit on W3C’s host. Sending a large batch of URLs through this node to W3C’s host can result in an IP being banned from W3C’s website. To avoid exceeding the quota limit, MMI advises users to set up their own validator host.