screeley.com

A Faster Python Script for Extracting Excerpts from Articles

July1

A couple weeks ago David Ziegler posted an article on how to extract excerpts from articles using Python and BeautifulSoup. It works well, but I would like to suggest some improvements by using lxml instead. It's a fairly simple problem. Get the title and the description out of the head, and if there is no description, try to pull some content out of the body. First two easy and the last one sucks, but Python has tools that make our life easier. BeautifulSoup is the go to for web scraping in Python, but it suffers when it comes to performance. lxml is definitely faster and in this case about 3 times so.

More >

Django Alfresco

June15

I have a wheelhouse and it's integrating Django with a Java Open Source project. Today I get to announce the next one, Django Alfresco. We combined the Alfresco's document management capabilities with Django's web tier components. I get mixed reactions when I tell people about this project. Anywhere from, "Why did you go and mess up a good thing" to "This is amazing." The former more than the latter, but I'm going to try to convince you that it is a really good idea to use this project. Jeff Potts who is the ECM lead at Optaros and got the project to a place where it could be released has a post on it and a screencast.

More >

Django Daemon Command Extension

June7

Recently I started to move Cubby Scott away from a cron and towards a queue. It's hard to be real time when you wake up a cron job once every 3 minutes. Lame. I'm also in the process of adding screenshots and content retrieval. Both take a good amount of time to process. The queue part was easy after reading Rabbits and Warrens and Working with Python and RabbitMQ. The problem came when I started working on the consumer, no one ever talks about the consumer. Well I'm going to give the consumer some love.

More >

CubbyScott.com | An experiment in 140 character requirements

May8

Is there a 3rd party twitter app that builds a link page based on my follows? If not, someone should build it. It would be my start page.

Fred Wilson posted this tweet a few days ago, a pretty simple requirement. Get all users that Fred is following, parse, get the links and display them for Fred's viewing pleasure. Personally I really like this idea. The problem with an asymmetrical relationship is that you really only follow that person for the interesting links they post. I follow mostly tech people and honestly, their personal comments don't really do much for me. It would be great if I could get all those links into one feed and filter out all the noise.

So in the last 4 days I put together an application to do this. Personal web developer to Fred Wilson and hopefully a few others out there.

More >

Class Based Views and Reusable Apps

April1

One of the major pain points of using reusable apps is modifying the view logic. No matter how many options you can pass into the view function, someone is going to want to do something different with it. I've been using Pinax for a little while now and modifying views is the only thing I don't love about it. Enter class based views. There is currently a ticket out there to make them part of Django 1.2 and there is a great example out on djangosnippets.org. Instead of trying to deal with everything in the url conf you get a class object with functions that you can override. The simple user will not know the difference between the two, while the advanced user can create custom views easily, without repeating logic. Who likes DRY?

More >

Denormalization and Managers

March13

This is a "This is how I solved a problem, I know there is a better way, so someone please tell me" post. Denormalization is something that has gotten a bit of press in the Django community. David Cramer has a great post on a model to handle this and there was some discussion on adding a DenormalisationField to have this to Django core. I ended up needing to use a denormalized field on a project, but still wanted to use managers to handle the related field.

More >

A Ride For Two

Feb.22

Yesterday we took the curtains off A Ride for Two which is the wedding website of my brother Brian and his fiancee Christine. There are a number of sites where you can create a wedding website out there, but honestly, where is the fun in that.

More >

Drupy: A Drupal port built on Django.

Feb.12

From the Drupy project home page.

Drupy is a Python port of the Drupal content management system built on Django.

Drupy had previously been a direct PHP-to-Python port of Drupal, but this had proven to be quite unmanageable. Drupy has taken a new direction in which it is being built on top of the Django Framework. This will offer maximum stability and a much more efficient development cycle.

I just have to ask why and cry on the inside. It's sort of like finding out your girlfriend is cheating on you with a mortal enemy.

More >

I'm a developer out of Boston MA and I work for a consulting firm specializing in open source technologies.

This space will deal with the work I've participated in using the Django framework to build applications for enterprise clients.

Finally, I hate the word blog and Drupal.

Ruminations

  • "best value pay as go mobile phone email accounts http://www.orderphonetoday.com/p660-windows-6-1-quad-band-with-wifi-gps-java--item45.html mobile software code <a href=http://www.orderphonetoday.com/qwerty-keyboard-category5.html>my mobile phone turns off and on by itself</a> mobile office ..."
    at 6:15p.m. Feb. 8, 2010 | permalink

  • "Немного провакационный пост. Поэтому такие и комменты :)"
    at 2:16a.m. Feb. 6, 2010 | permalink

  • "nobility mobile homes http://www.orderphonetoday.com/camera-phones-category10.html how to activate virgin mobile <a href=http://www.orderphonetoday.com/t2000i-quad-band-dual-card-with-analog-tv--item35.html>unblock lg mobile</a> mobile phones importer exporter"
    at 2:33a.m. Feb. 5, 2010 | permalink

  • "Только если вот кому интересно, ? могут ставить"
    at 12:43p.m. Jan. 28, 2010 | permalink

  • "Да это еще ниче тебя несколько видел ? Позитива к не хватает"
    at 7:14a.m. Jan. 28, 2010 | permalink

  • "Да это еще ниче Не давно ввел такую фичу ? это та вещь"
    at 8:45p.m. Jan. 27, 2010 | permalink

  • "истребить невозможно! понравилась ? могут ставить"
    at 7:32p.m. Jan. 27, 2010 | permalink

  • "не думая данного периода развития , Удивительн"
    at 5:50p.m. Jan. 27, 2010 | permalink

  • "ДА! но нашел для себя интересные реализации этим конкретно"
    at 4:26a.m. Jan. 26, 2010 | permalink

  • "Хорошо я её ещё день назад просматривал ВсЁ не осилил зараз((("
    at 4:21p.m. Jan. 25, 2010 | permalink

  • "Хорошо А какой самый ? Буду пробовать, искать… _____________ http://gallery007.ru"
    at 2:57p.m. Jan. 25, 2010 | permalink

  • "ДА! Просто, ясно и доступно. ВсЁ не осилил зараз((("
    at 1:13a.m. Jan. 25, 2010 | permalink