|
|
16 Member(s) Online
- Overview
- Presentations
- Con Activities
- Travel/Lodging/Food
- Help Out
- Past Cons
PyCon is a 100% Volunteer-run Conference Organized by Members of the Python Community.
Site/Questions etc ?
|
|
|
Details of Talk
Click Here to Respecify your Query
| #47: |
Query-directed Data Mining using Python and Parallel Processing
|
| Presented: |
Thu Mar 24, 2005
|
Sprinting Room 307
|
03:30 pm-04:00 pm
|
| Author(s): |
Christopher Gillett / Compete, Inc.
|
| Items: |
audio-no
handouts-yes
released-yes
video-no
|
ADMIN
|
| Abstract: |
Compete, Inc. has built a query processing system using Python which forms the basis for all query-directed data mining within our company. The system accepts queries written in "near SQL" syntax and performs parsing and analysis of the queries. Query decomposition allows exploitation of parallel computing resources and allows queries involving large amounts of data to be answered comparatively quickly.
The system was built entirely using Python and is used in conjunction with the Portable Batch System and various Unix utilities.
|
| Item(s): |
|
Note: Talk recordings have come from different donors, with different levels of quality.
A suffix has been added to the basename of each recording reflecting this. For eventual upload to
a repository like archive.org, a formal naming convention has been followed:
pycon-{date}-{track}-{timeslot}-{talkno}-{donor}.mp3
For those who might prefer a more human-meaningful name, the recordings have MP3/Ogg/Flac ID3
information within and a simple python script could rename your collection to something in a {title}-{author}
form.
|