Thanks Lee! Here is the video link:
What is TopMeta?
When you import either your own data or live social media feeds into DiscoverText, that data often includes various “metadata,” providing a wealth of revealing information about the Tweet, Facebook post, public comment, or survey response you will be analyzing. “TopMeta Explorer” is the function in DiscoverText that allows you to view the number of most (or least) frequently occurring metadata items and filter your data according to that metadata. Considering the wealth of metadata that may be within your data, the ability to easily organize and filter such metadata may turn out to be the difference between substantive and inadequate research.
Metadata is Power
When might the organization of metadata come in handy, you may also ask? It’s easy to imagine the answer to this question when you consider the kinds of metadata you may collect from live feeds such as the public Twitter API or the GNIP PowerTrack. From those feeds alone, you may collect any of the following metadata (depending on your search method):
1) The time & date of a Tweet, 2) the account name of the tweet’s sender, 3) the real name of the tweet’s sender, 4) the “hashtags” in a tweet, 5) the account name(s) “mentioned” in a tweet, 6) the shortened URL in a tweet, 7) the expanded URL in a tweet, 8) a link to the tweet itself, 9) a direct link to the media in a tweet, 10) the geo-coordinates from which a tweet is sent, 11) the number of “followers” of a tweet’s sender, 12) the number of those “following” a tweet’s sender, 13) the date that a tweet sender’s account was created, 14) the city of the tweet sender, and 15) the “Klout” score of the tweet’s sender.
Until now, the “TopMeta Explorer” function has allowed users to easily sort this kind of metadata within DiscoverText.
As of this week, this metadata can now be exported as a .CSV file, empowering Enterprise DiscoverText users to more seamlessly utilize the capabilities of DiscoverText, in tandem with their other research tools. We’ll continue to keep you posted about exciting new developments in DiscoverText as they are launched. If you are interested in trying DiscoverText for yourself, sign-up at discovertext.thrivehivesite.com and email me at firstname.lastname@example.org. I’ll be happy to get you started.
The sign up remains open. Jump in and let us know if you like our Enterprise solution for social media analytics.
Texifter is launching a second beta test period using “Power Track for Twitter” fire hose filtering a service provided by GNIP. We have streamlined the process of providing Enterprise class access to the beta test. This beta includes access to an expanding set of tools for archiving, filtering, coding, validating and machine classifying text. You can train a custom machine classifier in about 30 minutes.
The GNIP Power Track, in partnership with Twitter, provides users with unrestricted, real-time filtering of the Twitter fire hose. This enriched feature for DiscoverText provides a valuable analytical tool to our users. Not only will the GNIP Power Track provide users with access to the full stream of fire hose data, it will also provide Klout scores, language data, re-tweet frequency, geographic coordinates, and all #hashtags where available in the results. Taken together, this quantity of data and rich metadata fields will allow users to perform valuable social media analysis within DiscoverText.
For more information: info@DiscoverText.com
DiscoverText is rolling-out an addition to its analytical toolkit: random sampling. The Web-service already offers an array of tools for text analytics and rigorous, team-based qualitative data analysis. These functions include the ability to code and annotate text, measure inter-rater reliability, adjudicate coder validity, attach memos to text, cluster duplicate and near-duplicate documents, share documents, and to classify text using an active-learning Naive-Bayesian classifier. While still in beta, random sampling is a key new addition.
After DiscoverText users amass extraordinary amounts of social media data (for example via the Public Twitter API, the GNIP Powertrack, or the Facebook Social Graph), they can now more easily extract a random sample for analysis. The size of the sample is decided by the user in order to accommodate to iteration, experimentation and other scientific methods. The option is streamlined into the dataset creation process. On the new dataset creation page, you see a sample size prompt.
This additional method for data prep and analysis augments current information retrieval techniques, such as search with advanced filtering. It also builds up our framework for expanding available NLP methods from straightforward Bayesian classification, which aims to analyze substantial quantities of data in their original bulk-form, to a menu of computationally intensive methods that can iterate more quickly and effectively against random data samples. For example, the LDA topic model tool we are releasing will be faster and more effective against smaller random samples.
This new feature accommodates both an additional analytical approach as well as the opportunity to easily compare results between competing (or complimentary) analytic methods. We look forward to experimenting with this new tool and hearing about how random sampling will enhance the research of our users and users to come.
Special Note to DT Users: We need to turn this feature on one account at a time while we are testing it. Drop us a line if you want to try the tool.
We’ll keep you posted on the launch as more dataset modifications are pushed live. As always, if you have any questions, feel free to email us anytime at email@example.com. Your feedback is crucial. Sign up and try it out for yourself at discovertext.thrivehivesite.com.
We have been delighted with the response to our call for beta testers to try the GNIP-enabled PowerTrack for Twitter. You can still sign up. Round 1 of the beta test concludes on October 31, 2011. Even just testing the system’s data filtering and collecting capabilities for 1 or 2 days, or as few as 1-2 hours, may convert you to a devoted GNIP via DiscoverText user. As part of taking beta tester applications, we asked folks to tell us something about how they planned to use the beta test opportunity. Thanks to “Wordle” we can visualize an answer to the question: “Why do people want to take part in the GNIP beta test via DiscoverText?”
This is an 11-minute tutorial covering how you get started using the GNIP Power Track for Twitter (the “full firehose”) to capture large numbers of Tweets for analysis.
This short video talks about some of the advantages when using the GNIP-enabled Power Track for gathering Tweets via DiscoverText.
DiscoverText is preparing to launch a short and exclusive beta test period using “PowerTrack for Twitter Firehose Filtering” a service provided by GNIP. Compared to the “rate limited” service offered by DiscoverText through the public Twitter API, the “Full Firehose” is 50-100 times the volume with powerful Klout, language and keyword filters.
If you would like to participate in this trial, please leave us your contact information and tell us a little bit about your work. We will not be able to offer this trial service to everyone, so please make the case for the value you or your organization will add as beta testers.