Bentley Like Beckham wrote:Can someone give me an overview of exactly what we're trying to accomplish here?
ezubaric wrote:at Caltech we tried to have an online system called Jerome where people just filled in templates and then it got categorized and sent to an editor, but it never really caught on. I used it for myself for editing, but people couldn't be compelled to use it (even though you got real-time feedback from editors and got to see your questions move through the editing process). I'm not saying that Jerome is the answer (he's old, creaky, and built on old technology), but I think the idea was a decent one.
grapesmoker wrote:Actually, I think as far as collaborative editing goes, a system that either utilizes Google Docs or some CMS system like Drupal would be the way to go. I'm exploring the possibility of using Drupal for this but I haven't had much experience with it so I don't know how feasible it is.
Bentley Like Beckham wrote:Can someone give me an overview of exactly what we're tryiing to accomplish here?
As someone who vaguely remembers submitting questions via Jerome, I think the hardest part was that you didn't know what parts of the distribution you had filled already. I don't remember all that much about the process, so this might be wrong, but (1) I don't think there was any code that said "this is RMP, this is Literature, etc." and the system just tried to guess by answer selection, and (2) I don't remember whether you could see all the questions your team wrote or just all the questions you submitted; if the latter, then that's a huge problem. You also had to submit questions one at a time, rather than as a complete file, and I'm more sure on this because I don't have anything saved to my computer from that tournament except a largely incomplete packet, and I know I submitted at least 3/4 of the full packet via Jerome. This system seemed much more inefficient from the packet-submission end than from an editing aspect.ezubaric wrote:I also second the idea of a quiz bowl question editor that would produce packets in whatever format was desired; at Caltech we tried to have an online system called Jerome where people just filled in templates and then it got categorized and sent to an editor, but it never really caught on. I used it for myself for editing, but people couldn't be compelled to use it (even though you got real-time feedback from editors and got to see your questions move through the editing process). I'm not saying that Jerome is the answer (he's old, creaky, and built on old technology), but I think the idea was a decent one.
dschafer wrote:Regarding automatic scoring: there was a room at ICT where this was done, and it was awesome. It had a scoreboard, what tossup we were on, countdown timers for bonus parts, and even a little indicator for which player had buzzed. It was pretty amazing.
cvdwightw wrote:As someone who vaguely remembers submitting questions via Jerome, I think the hardest part was that you didn't know what parts of the distribution you had filled already. I don't remember all that much about the process, so this might be wrong, but (1) I don't think there was any code that said "this is RMP, this is Literature, etc." and the system just tried to guess by answer selection, and (2) I don't remember whether you could see all the questions your team wrote or just all the questions you submitted; if the latter, then that's a huge problem. You also had to submit questions one at a time, rather than as a complete file, and I'm more sure on this because I don't have anything saved to my computer from that tournament except a largely incomplete packet, and I know I submitted at least 3/4 of the full packet via Jerome. This system seemed much more inefficient from the packet-submission end than from an editing aspect.
dschafer wrote:We develop a universally agreed-upon XML quizbowl schema, probably similar to or based on the XML output from xmlize.pl. We use that script (or a variant) to convert current or old Word packets to this XML format. Additionally, we develop a standard plain-text markup system for those of us that would prefer that, and create a script to convert the plain-text to the XML format. Basically, any packet could be easily converted into this standard XML format.
cvdwightw wrote:I'm not by any means the go-to guy for markup languages and all that fancy stuff, but I think there's a bit of an issue between how people are submitting packets and how packets come out. For instance, Jerry's algorithm searches for "tossups", then treats everything as a tossup. This doesn't work if instead I submit my packet with "Literature 5/5" and then put five tossups and five bonuses, then "History 5/5" with five tossups and five bonuses, etc. It works great for finished packets, but not so well with submissions. On the other hand, if we make a LaTex shell like Andrew suggests, then each tournament can customize it to their distribution by using whatever number of tossup and bonus blocks they want, along with a \category{} line that the editor fills in. The advantages of this are great - uniformity, for one thing - and there's not a super-high barrier to entry for new writers, because you've already given them everything that they need, all they need to do is type the question, put the answer in the appropriate spot, put the prompts/do not accepts in the appropriate spot, and send. At this point, maybe I'm dreaming, but there's another shell that allows even a LaTex-illiterate editor to create a full packet from the edited original version, and then the editor just exports to .pdf file.
Deesy Does It wrote:I feel like openly joining Ryan Westbrook's movement of quizbowl luddites. Please people,, things are pretty much good I think. We have questions to play on, and quite often they are well written. That's really all you need to have good quizbowl. I don't want to play on computer-read quizbowl that automatically scores itself or anything, I want to have a moderator who reads me questions. I don't want to have to learn any kind of computer script to write my packets, especialyl considering my poor experience trying to learn LaTex, and beyond Jerry's searchable database, I don't see any real reason to come up with all kinds of codes for quizbowl (especially since it sounds to me like it's not too hard to convert packets that are already ACF formatted into searchable texts for the QBDB). Beyond that, what is so necessary about all of this? It looks good? I'm honestly not following.
NoahMinkCHS wrote:Also, for the self-described Luddites: AFT is actually less newfangled than Word. It's platform-independent plain text! Beyond robot readers and databases, there are plenty of reasons to eschew a proprietary, error-laden file format that only complicates the writing, submission, editing, formatting, and reading steps by making dumb guesses about what the writer might be trying to do. Plain text has a much lower "ZOMG fear new technology!" factor than Word. I've never edited anything more advanced than a house-written HS tournament or my team's packet for a submission tournament and even I've seen a multitude of ways Word makes life harder. Now that we have to deal with DOCX format from Word 2007, not to mention people submitting files that were created in all 103 previous versions of Word, plus OpenOffice, plus Mac versions, that's only going to get worse. Meanwhile, everybody has Notepad...
ezubaric wrote:The take home lesson of this is that any proposed system must not impose any additional burdens on the end-user. Even it makes life much easier for editors, it won't get adopted unless it also somehow helps the people that will have to interact with it.
grapesmoker wrote:Obviously in the end we'd like to end up with something that is intuitive for the end user and avoids any need to manually do markups. I recognize the shortcomings of my algorithm, which is why at the moment it's only useful for importing compiled packets.
people become the brunt of yet another round of jokes at your expense.Matt Weiner wrote:"you should use Linux/emacs/a magnetized needle that you manually change bits in your RAM with"
Matt Weiner wrote:NoahMinkCHS wrote:Also, for the self-described Luddites: AFT is actually less newfangled than Word. It's platform-independent plain text! Beyond robot readers and databases, there are plenty of reasons to eschew a proprietary, error-laden file format that only complicates the writing, submission, editing, formatting, and reading steps by making dumb guesses about what the writer might be trying to do. Plain text has a much lower "ZOMG fear new technology!" factor than Word. I've never edited anything more advanced than a house-written HS tournament or my team's packet for a submission tournament and even I've seen a multitude of ways Word makes life harder. Now that we have to deal with DOCX format from Word 2007, not to mention people submitting files that were created in all 103 previous versions of Word, plus OpenOffice, plus Mac versions, that's only going to get worse. Meanwhile, everybody has Notepad...
Please remember to avoid the inherent fallacy of all "you should use Linux/emacs/a magnetized needle that you manually change bits in your RAM with" arguments, and do not assume that everyone is coming into this from the state of nature. People already know how to use Word, and anything else is something they have to learn. It's not an all-things-equal situation.
grapesmoker wrote:dschafer wrote:We develop a universally agreed-upon XML quizbowl schema, probably similar to or based on the XML output from xmlize.pl. We use that script (or a variant) to convert current or old Word packets to this XML format. Additionally, we develop a standard plain-text markup system for those of us that would prefer that, and create a script to convert the plain-text to the XML format. Basically, any packet could be easily converted into this standard XML format.
Much of this work is already be done (though I wouldn't be so presumptuous as to suggest that my particular approach is "universally agreed-upon.") What really needs to happen, and hasn't yet, is a single script, that end-to-end transforms a packet into XML and maybe even contacts the server to upload the packet, to make it easier for people to work with.
dschafer wrote:Can you create a place (on the qbwiki, maybe) where you outline the XML schema you use? The one thing that has been puzzling me is how to deal with 30-20-10, "5 for one, 10 for two..." and "FFPE and a five point bonus" type bonuses, which (though they may be uncommon), should still be addressable by the XML format.
[10] For five points for one, and ten for two, identify Jerry Vinokurov's two least favorite pieces of Microsoft software.
BuzzerZen wrote:One piece of software that I suspect would be interesting to some here is Prince, which transforms X(HT)ML into PostScript/PDF documents using ordinary CSS with some extensions. So it should be possible to transform QBML into PDF packets without using LaTeX as an intermediary. It's proprietary software, and use on a server technically requires an expensive server license, and the free version embeds a watermark on the first page of every document...on the other hand, it's very nifty.
Edit: Can I suggest that "QBML" not be used as the name, as this is the same acronym used by NAQT (according to their new writer's kit) and thus might lead to confusion.
grapesmoker wrote:edit: Quick rundown of QBML now available for those interested.
ezubaric wrote:grapesmoker wrote:edit: Quick rundown of QBML now available for those interested.
Nice.
Some more things that we had in Jerome (obviously more geared toward editing):
1. author for each question
2. editor for each question
3. status of each question (newly submitted, needs editing, finished)
4. category of each question
Also, why are questions within tossup and bonus tags? Why not just have a type for each? It would make submitting questions easier (if you have four people, you could just copy and paste submissions into a single file rather than separating out tossups and bonuses).
grapesmoker wrote:Can you give an example of what you mean? I'm not sure I totally understand this.
ezubaric wrote:
How would "ANSWER: _J_ohn Fitzgerald _Kennedy_" be encoded?
<answer><req>J</req>ohn Fitzgerald <req>Kennedy</req></answer><clue>FAQTP, identify this curved <power value="15" />yellow fruit.</clue>metsfan001 wrote:What about _Philip_ _II_ _of Macedon_, where either Philip II or Philip of Macedon is acceptable?
<answer>
<req>Philip II</req> of Macedon
<accept><req>Philip</req> II of <req>Macedon</req></accept>
<prompt>Philip</prompt>
</answer>
grapesmoker wrote:There's no real need to have the "accept" portion be a separate tag; this needlessly complicates parsing. It's enough to just specify the alternately acceptable answer and mark the parts that are acceptable with the <req> tag. It's what we do now anyway and I'm not sure there's need for a special "accept" or "prompt" tag.
<answer>
<req>Arsenal</req> F.C.
<prompt>Gunners</prompt>
</answer>
dschafer wrote:I'm not sure I follow; how would you do Philip II?
<answer>
<req>Philip II</req> or <req>Philip of Macedon</req>
</answer>
<answer>
<req>Arsenal</req> F.C. (prompt on <req>Gunners</req>)
</answer>
dschafer wrote:Okay, I see. The benefit of those extra answer tags would be for an IRC bot (or something similar) using the CQML as its question source; with <accept>,<prompt> and <dna> tags, the bot can easily figure out what to do with a given answer, without having to do any sort of non-XML parsing.
grapesmoker wrote:dschafer wrote:Okay, I see. The benefit of those extra answer tags would be for an IRC bot (or something similar) using the CQML as its question source; with <accept>,<prompt> and <dna> tags, the bot can easily figure out what to do with a given answer, without having to do any sort of non-XML parsing.
Ah, that makes sense. It might be tough to work out how to automatically capture prompts and alternate answers from text, since, again, people are not consistent in how they indicate those parts.
BuzzerZen wrote:I think a sensible tack to take would be to, in general, just put everything into an answer line, and trust people who are competent and interested enough to write IRC bots to be competent and interested enough to convert human-readable answer lines into machine-interpretable ones.
dschafer wrote:Okay, I see. The benefit of those extra answer tags would be for an IRC bot (or something similar) using the CQML as its question source; with <accept>,<prompt> and <dna> tags, the bot can easily figure out what to do with a given answer, without having to do any sort of non-XML parsing.
ezubaric wrote:Since we don't quite have the kitchen sink in there, another nice markup would be in the question itself; imagine if all of the novels, place names, people, dates, etc. were marked in a question. Then you could easily do a search for "ALL PEOPLE MENTIONED IN QUESTIONS ABOUT MANIFEST DESTINY, ORDERED BY FREQUENCY". That would be awesome. Again, I'm not saying that we should make this the standard, but having the schema support this kind of thing would be cool (and make checking for repeats easier if it did exist ... it could even be done semi-automatically, as NE recognition is getting pretty good).
grapesmoker wrote:Not sure what the utility of such a search would be. You can already find every question on manifest destiny in the database, what would be the point of having proper names marked up specially? We could certainly throw that in at some further point if we're feeling adventurous, I guess.
ezubaric wrote:Well, seeing that 74% of questions on Manifest Destiny mention Horace Greeley without having to read through all of them would be cool.
fleurdelivre wrote:ezubaric wrote:Well, seeing that 74% of questions on Manifest Destiny mention Horace Greeley without having to read through all of them would be cool.
Isn't that when you just query the results of the first and do some quick arithmetic, though? I mean, it would be cool, but if we don't start with something practical /simple this will never get off the ground.
In an <ANNOTATION TYPE="POWER">editorial</ANNOTATION>, <ANNOTATION TYPE="PERSON">Horace Greeley</ANNOTATION>'s paper ...
ezubaric wrote:
- Code: Select all
In an <ANNOTATION TYPE="POWER">editorial</ANNOTATION>, <ANNOTATION TYPE="PERSON">Horace Greeley</ANNOTATION>'s paper ...
So rather than have a specific power tag, we could just have a general "ANNOTATION" that could denote special properties of individual phrases in the text. People could just throw away the annotations they didn't care about. This would allow for flagging other stuff like adult language, meta, etc.
BuzzerZen wrote:{quotes removed for brevity}
I think a sensible tack to take would be to, in general, just put everything into an answer line, and trust people who are competent and interested enough to write IRC bots to be competent and interested enough to convert human-readable answer lines into machine-interpretable ones.
Matt Weiner wrote:You could implement that as a second session of tinkering with the packets when they are added to the database. I've already talked to Jerry about adding a tagging system that would let trusted users mark questions as "history" "social science" "geography" etc, and then again by subcategory. You could mark out people, places, years, and whatnot in the same way.
Matt Weiner wrote:I've already talked to Jerry about adding a tagging system that would let trusted users mark questions as "history" "social science" "geography" etc, and then again by subcategory.
You could mark out people, places, years, and whatnot in the same way.
Return to College area archives
Users browsing this forum: No registered users and 2 guests