MMI Labs is working with KNIME and its active community to bring marketing methodologies to its offering, extending its capabilities from its more traditional uses in statistics and general data analysis. To download our nodes, download the KNIME platform. Once KNIME is installed, you can access our public nodes under the community nodes section of the nodes repository.
This node leverages Jsoup to parse HTML content and Lucene to tokenize terms and generate a list of term frequency. You can also configure it to filter out certain keywords that should not be excluded in the report.
Clean HTML Retriever
This node takes a URL and retrieves its content (assuming it is in HTML form) for parsing within KNIME. If HTML content is available in another column, it can use the HTML content directly. The content is then parsed and cleaned using HtmlCleaner to output it into XHTML form. The results can be configured to output in either String or XML type to read content from the page for analysis or storage.
XML Sitemap Reader
This node accepts a sitemap URL input and produces sitemap entries in an output table. It also supports sitemap index. Note: Sitemap content must be in XML form and compatible with the specifications found in http://www.sitemaps.org/.
Google PageSpeed Insight
This node submits URL(s) to Google’s PageSpeed Insights service to pull PageSpeed scores and other metrics provided by the PageSpeed API.
This node uses Mozscape API Connector to pull link and URL metrics data about specific pages on a domain.
Random Data Generator
This node creates a data table with random data based on definitions specified for each field column. Up to 1,000 field columns can be created. The number of rows generated can be configured.
This node uses Twitter’s API Connector in order to pull trends data. It will first make a call to trends/available to download a list of available cities/countries and populated at the Place drop down. Select a country or city and it will then make a call to trends/place to pull down Twitter Trends data for that specific city or country.
W3C CSS Validator
This node submits URL(s) to W3C CSS Validator service in order to evaluate the quality of the CSS source files. It uses W3C’s host http://jigsaw.w3.org/css-validator/validator by default. Note: An unpublicized quota limit on W3C’s host can result in an IP address being banned from W3C’s website. To avoid exceeding the quota limit, MMI advises users to set up their own validator host.
W3C HTML Validator
This node submits URL(s) to W3C Nu HTML Validator service to evaluate the quality of the HTML source. It uses W3C’s host https://validator.w3.org/nu/ by default. Note: There is an unpublicized quota limit on W3C’s host. Sending a large batch of URLs through this node to W3C’s host can result in an IP being banned from W3C’s website. To avoid exceeding the quota limit, MMI advises users to set up their own validator host.
February 26, 2016Benjamin Spiegel Speaks at KNIME’s Spring Summit in Berlin
October 19, 2015Correlations and Why They Matter in Advertising
September 16, 2015MMI Announces New Partnership with KNIME.com AG