web-archive-net.com » NET » C » CATHOLICRESEARCH.NET

Total: 236

Choose link from "Titles, links and description words view":

Or switch to "Titles and links view".
  • Data warehousing Web server log files – Catholic Research Resources Alliance (CRRA) Blog
    database enabling people to query it With this approach counting the number of times anything occurs could be as easy as a single SQL Structured Query Language query as opposed to tabulating 10s of thousands of log file entries To date the database is simple and defined by the following MySQL specific SQL statement CREATE TABLE IF NOT EXISTS crra logs transactions id INT NOT NULL AUTO INCREMENT PRIMARY KEY host VARCHAR 128 NOT NULL username VARCHAR 16 NOT NULL password VARCHAR 16 NOT NULL datetime DATETIME NOT NULL timezone VARCHAR 8 NOT NULL method VARCHAR 8 NOT NULL request VARCHAR 1024 NOT NULL protocol VARCHAR 8 NOT NULL statuscode VARCHAR 8 NOT NULL bytessent INT NOT NULL referrer VARCHAR 1024 NOT NULL useragent VARCHAR 1024 NOT NULL hosttype VARCHAR 16 DEFAULT unknown requesttype VARCHAR 16 DEFAULT unknown The astute Web server administrator will notice how the database s structure mirrors almost exactly an Apache combined log file with the following exceptions id is a unique key datetime is a reformulation of the time stamp found in the Apache s logs hosttype and requesttype are fields used to classify transactions explained below I then wrote a Perl script to read log files parse each entry into fields and stuff the result into the database Thank goodness for regular expressions Once this is done it is almost trivial to answer questions like this How many different computers from the University of Notre Dame used the Portal SELECT COUNT host AS c host FROM transactions WHERE host LIKE nd edu GROUP BY host ORDER BY c DESC What are the 100 most popular requests sent to the server SELECT COUNT request AS c request FROM transactions GROUP BY request ORDER BY c DESC LIMIT 100 My computer s address is lib 1234 library nd edu What requests did I make against the Portal on December 13 2010 and in what order SELECT datetime request FROM transactions WHERE host lib 1234 library nd edu AND datetime LIKE 2010 12 13 ORDER BY datetime ASC Unfortunately without some extra knowledge answering Pat s questions is still problematic For example how does one count hits against the Portal when requests from Internet robots and spiders bloat the input How does one accurately count searches for content and record views when so many of the requests include calls for images javacript files and cascading stylesheets The answers lie in the use of classification as well as the hosttype and requesttype fields Many most of the hits on the Portal come from a computer named googlebot com I know this is a robot and I can flag database records accordingly with the following SQL UPDATE transactions SET hosttype robot WHERE host LIKE googlebot com Once I do this for all the robots hitting the Portal I can accurately answer the question What computers operated by humans use the Portal the most SELECT COUNT host AS c host FROM transactions WHERE hosttype robot GROUP BY host ORDER BY

    Original URL path: http://www.catholicresearch.net/blog/2011/01/data-warehousing-web-server-log-files/ (2016-04-29)
    Open archived version from archive

  • Progress with statistics reporting – Catholic Research Resources Alliance (CRRA) Blog
    yesterday s log data Otherwise it expects two inputs 1 a beginning date and 2 an ending date If given these inputs then I can drop the entire database and re create it almost effortlessly The script is called log load pl and it is now running under cron so the database gets updated daily The next step is to create and automate reporting functions I have already created a number of SQL queries They are designed to be run from a shell script which outputs results to plain text files These plain text files are presently put on the Web in a temporary location This process is rather brain dead The next steps will include creating some sort of Web based front end allowing readers increasingly I don t use the word users to complete some sort of form and get real time results Wish me luck Author Eric Lease Morgan I am a librarian first and a computer user second My professional goal is to discover new ways to use computers to provide better library services I use much of my time here at the University of Notre Dame developing and providing technical support for the Catholic Research Resources Alliance the Catholic Portal View all posts by Eric Lease Morgan Author Eric Lease Morgan Posted on September 15 2011 February 15 2013 Categories Tech Issues Tips Post navigation Previous Previous post Advancing Catholic Scholarship A Symposium at Duquesne call for posters details Next Next post CRRA July August 2011 Update Categories Announcements 24 Collections 3 Events Meetings 14 Newspapers 2 Tech Issues Tips 47 Uncategorized 7 Updates 45 Search Search for Search Archives February 2016 December 2015 November 2015 June 2013 May 2013 March 2013 February 2013 January 2013 December 2012 November 2012 October 2012 September 2012 August

    Original URL path: http://www.catholicresearch.net/blog/2011/09/progress-with-statistics-reporting/ (2016-04-29)
    Open archived version from archive

  • Transforming schema-based EAD files – Catholic Research Resources Alliance (CRRA) Blog
    is full of two or three namespaces None of my XPath statements worked A number of people offered a number of suggestions Some of them required the use of XSLT 2 0 which is not an option for me Others thought I should update my existing stylesheets to accomodate the namespaces but that would have been too complicated and not scalable In the end I chose a different solution which was alluded to by a number of other people remove the namespaces Each person offered a slightly different take on the problem but in the end I went for a brute force method I found in the TEI community Web space xsl stylesheet version 1 0 xmlns xsl http www w3 org 1999 XSL Transform xsl output method xml indent no xsl template match comment processing instruction xsl copy xsl apply templates xsl copy xsl template xsl template match xsl element name local name xsl apply templates select node xsl element xsl template xsl template match xsl attribute name local name xsl value of select xsl attribute xsl template xsl stylesheet Consequently my XML processing pipeline now looks this harvest EAD files validated them strip namespaces add unitids transform them into HTML index them done The next thing to do is improve Step 5 since the generic EAD to HTML transformation is just that too generic Author Eric Lease Morgan I am a librarian first and a computer user second My professional goal is to discover new ways to use computers to provide better library services I use much of my time here at the University of Notre Dame developing and providing technical support for the Catholic Research Resources Alliance the Catholic Portal View all posts by Eric Lease Morgan Author Eric Lease Morgan Posted on April 10 2012 February

    Original URL path: http://www.catholicresearch.net/blog/2012/04/transforming-schema-based-ead-files/ (2016-04-29)
    Open archived version from archive

  • CRRA Update March 2012 – Catholic Research Resources Alliance (CRRA) Blog
    or Tuesday s meeting and or lunch so as to provide appropriately for a dinner reservation and for breaks and lunch on Tuesday On Monday evening June 25 we will gather for what is always an enjoyable dinner at a casual restaurant We meet about 6 30 We will make a group reservation We meet on Tuesday from 9 00 a m through 12 30 p m followed by lunch optional Our agenda is focused on mission support for the next year identifying top priorities ideas for forming local teams and expanding our understanding of Catholic Studies With the announcement that the Board has adopted a five year strategic plan we will be asking committees to develop their annual goals in this context and will be inviting all members to participate in identifying high priorities for the coming year Agenda Welcome Janice Welburn chair Board of Directors Annual goals objectives and priorities Moderator Pat Lawton Forming institutional teams Panel discussion TBA Catholic Studies and challenges facing Catholic educators Rev James Heft S M President Institute for Advanced Catholic Studies at the University of Southern California and Member CRRA Leadership Council We look forward to meeting with as many of you as can be there Please share this invitation with any others at your institution who may also be in Anaheim Traditionally our meetings are open to others interested in our mission and activities If you know of others who might like to attend you can share this information or request that Pat or Jennifer do so See you there Jennifer Younger CRRA Executive Director Diane Maher Chair CRRA Collections Committee We are delighted to announce that Diane Maher will serve as the Chair of the Collections Committee Diane is the University Archivist and Special Collections Librarian at the University of San Diego USD and liaison to the university s Art and Music Departments and USD s CRRA Liaison She holds an M A in English Literature from San Diego State University and an M L S from UCLA Before settling down for the most part in the Archives she spent time experiencing other aspects of librarianship from acquisitions and cataloging to serials and reference services Diane has been involved with CRRA since attending the auspicious Boston College meeting of 2007 Establishing a Process for Contributing to the Portal Member Perspectives CRRA recognizes that establishing a process for contributing member records to the Portal requires the institution to integrate this process into the existing workflow human and technical infrastructure This takes thought care and how to make this happen is not always apparent We will be presenting member stories of how member institutions integrate the task of including materials in the Catholic Portal as part of the institution s workflow and infrastructure Fran Rice Director of Information Systems and Digital Access at Roesch Library and CRRA Liaison graciously accepted our invitation to begin this series of member stories Thank you Fran and the University of Dayton team including Rachel Bilokonsky Colleen Mahoney Jillian Slater and Dean of Libraries Kathleen Webb pat Selecting Materials for the Catholic Portal The University of Dayton University Libraries apply two approaches to the recommendation and selection of materials for inclusion in the Catholic Portal collaboration and faculty recommendations Selection of materials for the Catholic Portal is part of the digital planning process of our Digital Projects Advisory Committee This committee includes representatives from the Marian Library International Marian Research Institute and the U S Catholic Collection of Roesch Library The Marian Library International Marian Research Institute is recognized as the world s largest and most comprehensive collection of printed materials on Mary The U S Catholic Collection preserves records of the Catholic Church and Catholic life in the United States and supports the University s new doctoral program in Religious Studies focusing on the U S Catholic experience The Digital Projects Advisory Committee meets twice a year to select and prioritize collections for digitization The committee considers the twelve collecting themes identified in the collection development policy statement of the CRRA to help identify materials appropriate for inclusion Many of the themes such as Catholic social action religious orders diocesan collections and Catholic liturgy and devotion are within the scope of the collections of the Marian Library and the U S Catholic Collection Recommendations also come from our library faculty Two of our faculty attended the Duquesne CRRA Symposium in November 2011 They returned from the symposium with a deeper appreciation for the Portal and quickly identified several rare research materials for inclusion Both the Marian Library and the U S Catholic Collection have hired project archivists for a 1 5 year project They have been tasked to inventory many unprocessed collections and create electronic finding aids EAD As the EADs are created they are evaluated for inclusion to the Catholic Portal Either one of these solutions might work at your institution as well Fran Rice Director of Information Systems and Digital Access Roesch Library University of Dayton CRRA Liaison Note The University of Dayton University Libraries joined the CRRA in 2010 and have 700 finding aids in the Catholic Portal Please browse all of Dayton s Portal records by searching for all records in the Portal and limiting your results to Dayton Now Available Keynote Address from the CRRA Duquesne Symposium by Leslie Woodcock Tentler The Catholic Library Association has given CRRA permission to share the link to the full text of Dr Tentler s inspiring keynote address presented at the November 2011 Symposium The full text article appears in the recent issue of Catholic Library World CLW vol 82 issue 3 and is available here courtesy of CLW http www cathla org images stories clw 2012 tentler pdf Thank you CLW and CLA CRRA Collections Spotlight Saint Thomas More Library Collection at the University of San Diego In the midst of the Depression the lifelong dream of Dr Julia T Metcalf the first female graduate from Tufts School of Medicine and her sister was

    Original URL path: http://www.catholicresearch.net/blog/2012/04/crra-update-march-2012/ (2016-04-29)
    Open archived version from archive