You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
## DUALIST: Utility for Active Learning with Instances and Semantic Terms ##
3
2
4
-
Burr Settles
5
-
Carnegie Mellon University
6
-
bsettles@cs.cmu.edu
3
+
_Hooray for recursive acronyms!_
7
4
8
-
Version 0.3
9
-
March 08, 2012
5
+
Version 0.3 / March 08, 2012
10
6
11
-
DUALIST is an interactive machine learning system for building classifiers
12
-
quickly. It does so by asking "questions" of the user in the form of both data
13
-
instances (e.g., text documents) and features (e.g., words or phrases). It
14
-
utilizes active and semi-supervised learning to quickly train a multinomial
15
-
naive Bayes classifier for this setting.
7
+
DUALIST is an interactive machine learning system for quickly building classifiers for text processing tasks. It does so by asking "questions" of a human "teacher" in the form of both data instances (e.g., text documents) and features (e.g., words or phrases). It uses [active learning](http://www.cs.cmu.edu/~bsettles/pub/settles.activelearning.pdf) and [semi-supervised learning](http://www.cs.wisc.edu/~jerryzhu/pub/ssl_survey.pdf) to build text-based classifiers at interactive speed.
16
8
17
-
NOTICE: This is currently "research-grade" code. It is provided AS-IS without
18
-
any warranties of any kind, expressed or implied, including but not limited to
19
-
the implied warranties of merchantability and fitness for a particular purpose
20
-
and those arising by statute or otherwise in law or from a course of dealing
21
-
or usage of trade. *Whew!*
9
+
Research related to DUALIST is described in these publications:
22
10
23
-
See LICENSE.txt for licensing information.
24
-
See CHANGELOG.txt for a history of updates.
11
+
* B. Settles. [Closing the Loop: Fast, Interactive Semi-Supervised Annotation With Queries on Features and Instances](http://aclweb.org/anthology/D/D11/D11-1136.pdf). In _Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)_, pages 1467-1478. ACL, 2011. ([addendum](http://www.cs.cmu.edu/~bsettles/pub/settles.emnlp11addendum.pdf))
12
+
* B. Settles and X. Zhu. [Behavioral Factors in Interactive Training of Text Classifiers](http://www.cs.cmu.edu/~bsettles/pub/settles.naacl12short.pdf). In _Proceedings of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT)_, pages 563-567. ACL, 2012.
25
13
26
-
Citation information and technical details:
14
+
Watch a [demonstration video](http://vimeo.com/21671958) of DUALIST in action!
27
15
28
-
B. Settles. Closing the Loop: Fast, Interactive Semi-Supervised Annotation
29
-
With Queries on Features and Instances. In Proceedings of the Conference
30
-
on Empirical Methods in Natural Language Processing (EMNLP), to appear.
31
-
ACL Press, 2011.
16
+
----
32
17
33
-
34
-
35
-
PURPOSE & GOAL
36
-
--------------
18
+
### Purpose & Goal ###
37
19
38
20
The purpose of DUALIST is threefold:
39
21
@@ -51,25 +33,26 @@ than the multinomial naive Bayes classifier currently used.
51
33
combine multiple "beyond supervised learning" strategies. This ICML workshop
52
34
is related: https://sites.google.com/site/comblearn/
53
35
36
+
See `LICENSE.txt` for licensing information.
37
+
See `CHANGELOG.md` for a history of updates.
54
38
55
39
56
-
INTALLATION + RUNNING THE GUI
57
-
-----------------------------
40
+
### Installation + Running the Web-Based GUI ###
58
41
59
-
DUALIST requires Java 1.6 and Python 2.5 to work properly. It ships with most
60
-
of the dependencies it needs to work, the only exception being the Play! web
42
+
DUALIST requires Java 1.6 and Python 2.5 to work properly. It ships with most
43
+
of the dependencies it needs to work, the only exception being the Play! web
61
44
framework for Java v1.1+, which can be downloaded here:
This work is supported in part by DARPA (under contract numbers FA8750-08-1-0009 and AF8750-09-C-0179), the National Science Foundation (IIS-0968487), and Google. Any opinions, findings and conclusions or recommendations expressed in this material are the authors' and do not necessarily reflect those of the sponsors.
0 commit comments