User Controls

  1. 1
  2. 2
  3. 3
  4. ...
  5. 308
  6. 309
  7. 310
  8. 311
  9. 312
  10. 313
  11. ...
  12. 638
  13. 639
  14. 640
  15. 641

Posts by gadzooks

  1. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
    Originally posted by Shrooms

    Dude sounds mad about something.
  2. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  3. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  4. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
    Originally posted by Nil

    I think that song might be about my ex-girlfriends' legs.
  5. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  6. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  7. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  8. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  9. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  10. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
    Originally posted by -SpectraL Obviously it would have to only output publicly available data for you to piss all over, but that's probably cleaner than fucking parsing the markup and trying to extract the son-of-a-bitch that way.

    I don't even mind parsing markup.

    I actually kinda like doing it from time to time.

    I've spent quite a bit of time knee deep in the NiS HTML/CSS when making my GreaseMonkey script for blocking infinityshock.

    The general structure of NiS templates is pretty familiar to me.
  11. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
    I don't know django right now.

    I could prolly figure it out if I was a bit more sober and clear headed and had a few hours, maybe even days, to go over the docs.

    But right now, my script is doing it's thing, and not too inefficiently from what I can tell.

    It's actually just about to reach 30% done.
  12. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
    Originally posted by -SpectraL ie:

    Yeah, go for it, you fucker. If you want to write a management command(django's mechanism for scripts that don't happen as part of the request/response cycle), but I'll bet you're too dumb for it, to pull it straight from the DB and dump that god damned garbage into some CSV files or something I wouldn't mind running it and just sending your idiotic ass the output instead of you having to scrape everything using that shovel nose of yours. Obviously it would have to only output publicly available data for you to piss all over, but that's probably cleaner than fucking parsing the markup and trying to extract the son-of-a-bitch that way.

    Wut?

    Nigga I'm drinking scotch, snorting k, and smoking crack right now.



    I'm pretty much that right now.

    Except my script I wrote while (mostly) sober, so it works.
  13. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
    Originally posted by Lanny Yeah, go for it. If you want to write a management command (django's mechanism for scripts that don't happen as part of the request/response cycle) to pull it straight from the DB and dump it into some CSV files or something I wouldn't mind running it and just sending you the output instead of you having to scrape everything. Obviously it would have to only output publicly available data but that's probably cleaner than parsing the markup and trying to extract content that way.

    I appreciate the offer.

    I've got a simple script just downloading each HTML response for every iterative thread request (i.e: "thread 1".html, "thread 2.html", etc) up until i == 10,000 for now). That would be close to a third of the entire content. I started running it with a half second interval between requests.

    I have some very simple exception handling (the bare essentials), and so my PyCharm console is showing me that it's currently at thread number 511xx. And it's also catching a very vaguely defined exception (on my part) and telling me that every few dozen threads there is a thread not saving... I imagine some threads have been deleted over the course of the years for one reason or another.

    But it keeps on truckin'. It's looking like it should actually be totally done the entire site before the night ends, which is pretty good. I'm just brute-saving full HTML files for each thread, and I'll use Beautiful Soup to parse the files locally in some fashion and make some kind of database.

    Panny was on my ass the other day about a word cloud I promised I'd make him, so it kinda lit the fire under my ass to just archive all the publicly viewable posts in one fell swoop so that I can analyze and process the data locally.

    The only other thing I might want to run a separate script for is extracting member post info stats (basically just username, reg date, post count, and thanks given and thanks received) for each user.

    I don't think it will necessitate downloading entire posts all over again. I'll optimize it as best I can.
  14. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  15. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  16. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
    Originally posted by gadzooks

    WHAT DEMON FROM BEYOND THE REALM OF ALL INNOCENT SOULS WENT AND POSTED THIS TRUNCATED VIDEO!?



    When a track cuts short right in the swing of the beat, a homicidal rage starts to brew within me.
  17. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  18. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  19. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
  20. gadzooks Dark Matter [keratinize my mild-tasting blossoming]
    Originally posted by -SpectraL It was written in 1999, recorded in 2000, and released in 2001.

    Good enough for me.
  1. 1
  2. 2
  3. 3
  4. ...
  5. 308
  6. 309
  7. 310
  8. 311
  9. 312
  10. 313
  11. ...
  12. 638
  13. 639
  14. 640
  15. 641
Jump to Top