Google Crawls Into Source-Code Search

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
25 messages Options
12
Reply | Threaded
Open this post in threaded view
|

Re: Google Crawls Into Source-Code Search

Philippe Marschall
2006/10/8, Klaus D. Witzel <[hidden email]>:

> On Sun, 08 Oct 2006 19:37:29 +0200, Philippe Marschall wrote:
> > 2006/10/8, Klaus D. Witzel wrote:
> >> On Sun, 08 Oct 2006 13:45:36 +0200, Philippe Marschall wrote:
> >> > 2006/10/8, Giovanni Corriga wrote:
> >> >> Il giorno sab, 07/10/2006 alle 12.17 +0200, Philippe Marschall ha
> >> >> scritto:
> >> >> > continuation lang:smalltalk site:squeaksource.com
> >> >> > but it looks like site is not implemented (and mcz are not
> >> indexed) so
> >> >> > it doesn't work.
> >> >> >
> >> >> > But considering the amout of code available at SqueakSource, most
> >> >> > matches probably will be from there anyway (once it's implemented).
> >> >> >
> >> >> > But I think we could play dirty little tricks. List the mcz files
> >> as
> >> >> > zip too in the listing....
> >> >> > .... now that I think about it .... ;)
> >> >>
> >> >> What about sending a mail to the Google Code team
> >> >
> >> > Did that but you have to be realistic.
> >>
> >> Why would the google crawler index http://www.squeaksource.com/<put id
> >> here>/ , are there hyperlinks pointing to SqueakSource projects and/or
> >> is
> >> there a (session-less) SqueakSource page which presents hyperlinks of
> >> all
> >> projects to the crawler?
> >
> > Because google knows anything ;)
> > http://www.google.ch/search?q=shorecomponents
> > (2nd hit)
>
> Hhm, did the meaning of *all projects* change recently?

Pier is hosted at mc.lukas-renggli.ch (currently down)

But if it had to be building a google sitemap for squeaksource would be easy.

Philippe

> -
> http://www.google.com/search?q=pier+seaside+squeak+site%3Awww.squeaksource.com
>
> >> >> and working with them?
> >> >
> >> > How should that work? Submit a patch? Get a job at Google?
> ...
> > Maybe we should start marketing SqueakSource as Web 2.0 of source code
> > management.
>
> :) why not, if it works.
>
> /Klaus
>
> > I mean we have:
> > - wikis
> > - rss feeds
> > - tags
> > - collaboration
> > - user content
> > - run on a Mac (a real one, not a soulless MacPC) ;)
> > - dynamic languages
> > - continuations/seaside
> > - a vcs that beats the crap out of everything that is file based
> > - ...
> >
> > A blog post from Avi could make a real impact.
> >
> > Philippe
> >
> >
>
>
>
>

Reply | Threaded
Open this post in threaded view
|

Re: Google Crawls Into Source-Code Search

Klaus D. Witzel
On Sun, 08 Oct 2006 20:02:13 +0200, Philippe Marschall wrote:

> 2006/10/8, Klaus D. Witzel wrote:
>> On Sun, 08 Oct 2006 19:37:29 +0200, Philippe Marschall wrote:
>> > 2006/10/8, Klaus D. Witzel wrote:
>> >> On Sun, 08 Oct 2006 13:45:36 +0200, Philippe Marschall wrote:
>> >> > 2006/10/8, Giovanni Corriga wrote:
>> >> >> Il giorno sab, 07/10/2006 alle 12.17 +0200, Philippe Marschall ha
>> >> >> scritto:
>> >> >> > continuation lang:smalltalk site:squeaksource.com
>> >> >> > but it looks like site is not implemented (and mcz are not
>> >> indexed) so
>> >> >> > it doesn't work.
>> >> >> >
>> >> >> > But considering the amout of code available at SqueakSource,  
>> most
>> >> >> > matches probably will be from there anyway (once it's  
>> implemented).
>> >> >> >
>> >> >> > But I think we could play dirty little tricks. List the mcz  
>> files
>> >> as
>> >> >> > zip too in the listing....
>> >> >> > .... now that I think about it .... ;)
>> >> >>
>> >> >> What about sending a mail to the Google Code team
>> >> >
>> >> > Did that but you have to be realistic.
>> >>
>> >> Why would the google crawler index http://www.squeaksource.com/<put  
>> id
>> >> here>/ , are there hyperlinks pointing to SqueakSource projects  
>> and/or
>> >> is
>> >> there a (session-less) SqueakSource page which presents hyperlinks of
>> >> all
>> >> projects to the crawler?
>> >
>> > Because google knows anything ;)
>> > http://www.google.ch/search?q=shorecomponents
>> > (2nd hit)
>>
>> Hhm, did the meaning of *all projects* change recently?
>
> Pier is hosted at mc.lukas-renggli.ch (currently down)

:(

> But if it had to be building a google sitemap for squeaksource would be  
> easy.

I hereby volunteer to submitting the new sitemap's URL to the crawler(s)  
...

/Klaus

> Philippe
>
>> -
>> http://www.google.com/search?q=pier+seaside+squeak+site%3Awww.squeaksource.com
>>
>> >> >> and working with them?
>> >> >
>> >> > How should that work? Submit a patch? Get a job at Google?
>> ...
>> > Maybe we should start marketing SqueakSource as Web 2.0 of source code
>> > management.
>>
>> :) why not, if it works.
>>
>> /Klaus
>>
>> > I mean we have:
>> > - wikis
>> > - rss feeds
>> > - tags
>> > - collaboration
>> > - user content
>> > - run on a Mac (a real one, not a soulless MacPC) ;)
>> > - dynamic languages
>> > - continuations/seaside
>> > - a vcs that beats the crap out of everything that is file based
>> > - ...
>> >
>> > A blog post from Avi could make a real impact.
>> >
>> > Philippe
>> >
>>
>


Reply | Threaded
Open this post in threaded view
|

Re: Google Crawls Into Source-Code Search

Philippe Marschall
In reply to this post by Klaus D. Witzel
There are other fun queries:

"i hate windows"
http://www.google.com/codesearch?q=%22i+hate+windows%22&btnG=Search+Code
"i hate you"
http://www.google.com/codesearch?q=%22i+hate+you%22&btnG=Search+Code
"running in hell"
http://www.google.com/codesearch?hl=en&lr=&q=%22running+in+hell%22+.&btnG=Search

Philippe

2006/10/7, Klaus D. Witzel <[hidden email]>:

> You don't have to kill, there's nothing recorded with your (SqP) author
> initials ;-)
>
> - http://www.google.com/codesearch?q=lang:smalltalk+Craig
>
> /Klaus
>
> On Sat, 07 Oct 2006 08:14:22 +0200, Craig Latta wrote:
>
> >
> >      Ah, source code in files...
> >
> >      (Kill! Kill! Kill! :)
> >
> >
> > -C
> >
>
>
>
>

Reply | Threaded
Open this post in threaded view
|

Re: Google Crawls Into Source-Code Search

Philippe Marschall
In reply to this post by Klaus D. Witzel
2006/10/7, Klaus D. Witzel <[hidden email]>:
> Have a look at
>
> - http://www.google.com/codesearch?q=lang:smalltalk+continuation
>
> and enjoy Open Source!

There is even more fun:
http://www.kottke.org/06/10/google-code-search

Philippe

Reply | Threaded
Open this post in threaded view
|

Update: Google Crawls Into Source-Code Search

Klaus D. Witzel
In reply to this post by Klaus D. Witzel
Hi folks,

I've just finished posting .gz, .tar.gz and .zip URLs from the release  
related subdirectories of
- http://ftp.squeak.org/

using this form <phew/>
- http://www.google.com/codesearch/addcode

Ken: is there a way to find out what crawlers access http://ftp.squeak.org/

/Klaus

On Sat, 07 Oct 2006 07:19:04 +0200, Klaus D. Witzel  
<[hidden email]> wrote:

> Have a look at
>
> - http://www.google.com/codesearch?q=lang:smalltalk+continuation
>
> and enjoy Open Source!
>
> /Klaus
>
>
>



12